Campus alert status is yellow: For the latest campus alert status, news and resources, visit

Search Close Search
Search Close Search
Page Menu

The UMMS Data Lake

UMMS Data Lake implementation combine data from disparate sources including data from the UMass Memorial Hospital Electronic Health Record System (EPIC), Public Health Data, Patient Registries and Administrative Data. The EPIC system represents the most comprehensive data source available for the research community and comprise a variety of data domains.

The data from clinical systems is routinely added into the Data Lake and is stored in its native format. The data then goes through a data engineering process (ETL) to transform it into a useful format for analysis. The ETL process (Extract, Transform, Load) extracts data from the source systems, applies data quality and consistency standards, integrates data from separate sources, and finally delivers data in the appropriate format for Visualization and Analytics.

Data Lake