STARR is Stanford’s single integrated data lake containing clinical data of different modalities from the three Stanford Hospitals and associated clinics. STARR contains “raw” and “analysis-ready” data as well as tools to analyze this data. The Electronic Health Records(EHR) data in the OMOP common data model along with the HIPAA compliant Big Data computing platform provide the framework for our Clinical Data Science infrastructure at Stanford. This presentation provides an overview of this Data Lake and the computing platform and the challenges and opportunities in our journey so far.
Watch the Recording