Research Compute Environments
This is an accordion element with a series of buttons that open and close related content panels.
Platform X
The SMPH has developed a cloud-based, HIPAA-compliant computing environment called Platform X. Built within Microsoft Azure cloud environment, Platform X provides a safe and scalable data analytics space for SMPH researchers. Platform X isolates the environment for each project, and every research team has dedicated and private data storage and computing resources. Platform X securely delivers research data to the teams’ storage space and creates exclusive, team-dedicated compute instances tailored to the teams’ analytic needs. All data are encrypted at rest and in transit. The necessary computing resources are provided within the environment to conduct data analysis in place, without downloading data. The platform leverages cloud native services as well as Virtual Machines (VMs). Center for Internet Security (CIS) benchmarks are applied to safeguard the computing and data when VMs are in use. Firewalls control all outbound internet access, with extensive centralized logging and auditing. Alerts are generated when potential incidents are identified. Advanced identity governance controls are in place that go beyond what is available on the premise within the SMPH and UW-Madison overall. Disaster recovery plans and data lifecycle management are defined for each project based on specific requirements. Dedicated information security staff are responsible for implementing security and privacy control procedures according to the NIST 800-53 Revision 5.
SMPH has established a generalizable and adaptable ‘Real-World Data Collaborative’ (RWDC) environment within the Platform X using Infrastructure as a Code to support large projects with multi-dimensional data and data linkages. The cloud infrastructure and RWDC include the following components (Figure 1):
- Globus: facilitates secure file transfer from external sites either via web-based uploads or Globus-to-Globus (for large data) transfers. Additionally, Globus stores data transferred to or from Globus. Retains a copy of source data and any data egressed from the platform for inspection/audit.
- Copy Function: creates a working copy of the data ingested via Globus to the Internal Storage Account in the cloud.
- Azure Data Factory: orchestrates the code that processes data in Internal Storage Accounts and integrates those with other data sources and metadata that may exist in the SQL Database.
- Git: enables researchers to maintain their code in a version control system.
- Continuous Integration/Continuous Deployment (CI/CD) Pipelines: deploy code to Azure Data Factory.
- VMs: host a variety of software tools (e.g. R, Python) for researchers to conduct analyses.
- Bastion provides a secure path for project team members to access the VMs within the Platform X using a Remote Desktop Connection.
- File Share: functions like a network where users across VMs share data and files.
- Backup Vault: all data in Storage Accounts, VMs, File Share, and SQL Database are routinely backed up.
- Security Center: monitors the environment for deviations from established control systems for compliance and suspicious activity. Makes recommendations to improve security posture and sends alerts when investigation is required.
This cloud-based infrastructure facilitates efficient and timely data sharing and submission to national repositories and promotes data sharing in compliance with NIH guidelines.
Platform R
SMPH is developing an on-premises High-Performance Computing (HPC) environment called Platform R. Platform R is free to use for SMPH members. Housed within a local secure data center, Platform R provides capabilities and tooling for large dataset analytics. Platform R supports these workloads with managed compute, storage, and networking resources. Platform R’s initial HPC cluster, Sett, is comprised of 40 server nodes, 2408 CPU cores, and 25 TB memory. Sett utilizes Slurm to schedule and manage compute jobs. Shared storage, for short-term scratch use, makes 400TB of physical disk space available through a Ceph distributed storage system running on high-speed NVMe drives. Ethernet over fiber networking uses 25GbE connections between compute nodes and 100GbE within storage nodes. The platform additionally connects to UW-Madison facilities for research data inputs and long-term data storage. These connected systems include ResearchDrive, the UW Biotechnology Center, campus’ S3-compatible Object Storage offering, and Globus. Initially, Platform R was not designed to store electronic protected health information (ePHI), although it is being designed for future HIPAA compliance. The technical architecture of this environment is guided by NIST SP 800-223 IPD. PR will be connected to PX for scalability and on-demand bursting.
Center for High Throughput Computing (CHTC)
The Center for High Throughput Computing (CHTC) supports a variety of scalable computing resources and services at UW-Madison. Among the resources available through CHTC is the High throughput compute (HTC) environment with over 20,000 cores and dozens of GPUs. This resource, which is free and shared, is available to all UW researchers. CHTC also provides access points to utilize the resources across the nation. Alongside the standard access to HTC resources, CHTC offers buy-in options for priority access that may be required by high demand or specific needs of a researcher. CHTC’s Research Facilitation team provides users with tools and guidance to manage and automate compute workflow on their resources.
Clinical Trials Software
This is an accordion element with a series of buttons that open and close related content panels.
Oncore
OnCore is a comprehensive Clinical Trial Management System (CTMS) that allows researchers to use one program to manage all components of clinical research, while maintaining consistency and compliance across the institution. Protocols are the foundation of all OnCore functionality and features. Protocol information must be entered in OnCore before building calendars, creating protocol budgets, enrolling subjects, tracking subject visits, or invoicing sponsors. The PC Console (Protocol Coordinator Console) is the central repository for protocol information. Protocol coordinators track protocol ID numbers, objectives, assigned staff, sponsors, participating institutions, regulatory information, investigational drug and device information, and other details of each research study.
Other key features:
- Subject Management: facilitates the tracking of study participants, including enrollment, visits, and other relevant data.
- Financial Management: helps manage budgets, invoicing, and financial aspects of clinical trials.
- Regulatory Compliance: assists in maintaining compliance with regulatory requirements and standards.
- Data Management: supports data collection, monitoring, and reporting throughout the clinical trial lifecycle.
- Integration: our instance of OnCore is integrated with the following external systems:
- HealthLink EHR (Epic): Patient billing and demographics
- eBinders (Florence): Regulatory binders
- Reporting and Analytics: provides tools for generating reports and analyzing data related to clinical trials.
Florence
Florence eBinders is an electronic system that replaces paper documents and physical binders for human subject research study documents. It is an efficient, 21 CFR Part 11 compliant way to electronically sign, manage, store, and share documents. Features of eBinders include the following:
- Document Management: helps organize and manage essential documents related to clinical trials, ensuring compliance with regulatory requirements.
- Collaboration: facilitates collaboration among study teams, sponsors, and other stakeholders involved in the clinical trial documentation process.
- Workflow Management: supports document workflows, version control, and electronic signatures to ensure document integrity and compliance.
- Regulatory Compliance: maintains compliance with regulatory standards and guidelines related to documentation in clinical trials.
- Site Access: enables secure access for participating sites and sponsors.
- Audit Trail: provides an audit trail of document actions and changes, enhancing transparency and accountability.
- OnCore interoperability: OnCore interfaces with eBinders, making it easier to create new binders and ensure appropriate access credentials are pushed to binders.
- Simple filing: send anything directly to your regulatory binder with the Florence ePrinter tool. This makes it easy for research teams to file PDFs, correspondence, and other important binder contents.
iLab (Agilent Technologies)
iLab (Agilent Technologies) is a cloud-based platform designed to facilitate core facility management and streamline research administration processes within academic and research institutions. Core facilities are shared resources within organizations that provide specialized services, equipment, or expertise to researchers.
Key features of iLab include:
- Core Facility Management: iLab helps manage core facilities by providing tools for scheduling equipment, tracking usage, and managing billing and invoicing.
- Service Requests: Researchers can submit service requests to core facilities through the iLab platform, streamlining the process of accessing specialized services.
- Resource Scheduling: The system allows for the scheduling of equipment, services, and resources within core facilities, helping to avoid conflicts and optimize utilization.
- Billing and Invoicing: iLab facilitates the tracking of resource usage and automates billing processes, helping core facilities manage financial transactions efficiently.
- Reporting and Analytics: The platform includes reporting tools to generate insights into resource usage, financials, and other relevant data.
REDCap
REDCap (Research Electronic Data Capture) is a largely self-service, secure, web-based application used for building and managing data collection forms. REDCap provides data management functionality by allowing the development of electronic instruments and surveys to support data capture for research studies, including those that involve protected health information (PHI). REDCap can also be used to collect data for quality improvement and operational support projects.
REDCap provides:
- An intuitive interface (with optional data validation for data captured on forms and surveys).
- Audit trails for tracking data manipulation and export procedures.
- Export procedures for seamless data downloads to common statistical packages (e.g. SPSS, SAS, Stata, & R).
- Reporting capabilities.
- Procedures for importing data from external sources.
Epic Health Link EHR
UW Health’s Health Link is a secure, web-based application for connecting private and referring physicians, skilled nursing and long term care facilities and their staff to Main Line Health’s electronic medical record. The Health Link tool and the underlying electronic medical records database and system are both developed by Epic Systems Corporation.
Health Link helps to integrate clinical care and research to support patient safety, protocol fidelity, and regulatory compliance. There are several self-service and custom tools available within Health Link to facilitate clinical research workflows such as:
- Assessing study feasibility
- Recruiting new participants
- Tracking enrolled participants
- Entering orders for participants
- Documenting research related care
- External research monitoring
- Communicating securely with research and clinical staff
- Billing for research
Much of Health Link research functionality is dependent on use of the clinical trials management systems, OnCore, which is interfaced with Health Link.
Biobanking Software
OnCore’s biospecimen management module (BSM) provides utilities that allow staff to record specimens that have been collected or received at their organization, to document details about each specimen (when it was collected, from what body site it was collected, etc.), and then to indicate which specimens have been shipped out, either to a lab for analysis or to fulfill a specific request from a principal investigator. The Biospecimen Management module is used to create both banking and correlative protocols to track specimens. Banking protocols record specimens collected for prospective banking studies. Correlative protocols collect specimens associated with specific visits on a subject’s calendar.
Features:
- Specimens Grouped into Cases: Cases are groupings of specimens. In banking protocols, cases are all the specimens collected within one consent record. In correlative protocols, cases are sets of specimens collected during one visit. Specimens are samples collected from a patient. Specimens can be stored, shipped, or divided into smaller samples called aliquots. One specimen that is processed into one or more aliquots is represented in OnCore with parent and child specimens. The original specimen collected from the patient is the parent specimen; the specimens created from the parent are child If child specimens are further divided or processed, additional levels of child specimens can be added in OnCore.
- Print labels for specimens: Labels for physical specimens can be generated by the BSM module and printed to dedicated label printers.
- Assign a storage location to specimens: The BSM module supports the configuration of freezers to store biospecimen. Samples can be tracked within the BSM module to track locations assigned within freezers.
- Specimen Custom Reports: Specimen Custom Reports is a powerful, flexible tool for users to create their own reports for searching and reporting on specimen data in OnCore.
- Requesting and Shipping Banked Specimens: After collecting specimens and storing them at your facility, they might be requested for future research. OnCore’s specimen tools allow you to find specimens that meet specific criteria, process the specimens for shipping, and then indicate that they have been sent to another location. You can track requests, approvals, and shipping details in OnCore.
- Importing Specimens into OnCore: The Specimen Import feature was added to accommodate the workflow when specimens arrive in bulk and large quantities of new specimen records need to be entered into OnCore quickly.
- BSM Case and Specimen Annotations Forms: Additional details about cases and specimens can be recorded in case annotation and specimen annotation forms. Biospecimen annotation forms are highly customizable and can be attached to many different “levels” within your specimen data collection tools. A new annotation form is created for every case in your OnCore environment, and a different form (with different fields) is generated for every specimen. Additional fields are made available when a Specimen Type annotation or Body Site annotation form is added to specimens with the appropriate attributes (such as Specimen Type = Blood, or Body Site = Lung).
SMPH Data Commons
The SMPH Data Commons (SDC) is a HIPAA-compliant cloud-based analytical platform. Hosted on Platform X, SDC is designed to enable UW researchers to connect to data and analytical tools in a secure and efficient way to analyze and visualize data and to collaborate with other researchers within the platform. SDC will contain data from multiple sources including EPIC electronic health records (EHR), disease registries, public databases and genomic data whenever available.
Multidimensional EHR subsets created using broad pre-defined disease phenotypes would be provisioned on SDC based on the individual researcher’s IRB approval status. Data would be available in Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) format to allow the use of available common tools and pipelines. Additionally, the available data will be curated for completeness and accuracy with enhance mapping for UW specific concepts. Apart from such analytical tools, SDC will have commonly used visualization tools like the ATLAS browser and cBioPortal for cancer data. Additionally, data dashboard will be made available to enable users to realize the potential of the data.
Educational Technology support
Technology | Description | Link | Supported By |
Oasis |
Web-based student information system used for course lottery, scheduling, curriculum tracking, and online clinical evaluations. |
SMPH IT |
|
Acuity |
Acuity One45 is the scheduling software used for student course calendars and evaluations in the MD program. |
EdTech |
|
Canvas |
Canvas is a cloud-based, vendor-hosted learning management system (LMS). Based on positive results from pilot studies and campus feedback, Canvas was adopted as the single, centrally supported UW-Madison LMS in 2016. |
Campus & EdTech |
|
Video Library |
The Video Library is a secure repository for Lecture Capture, Enduring Learning Objects, and other educational videos for UW Health Sciences Students. |
EdTech |
|
Studio Capture |
Turn your PowerPoint into a narrated presentation that can be published to the Video Library. All production done at the HSLC in the Educational Technology Studio. |
EdTech |
For more information submit the SMPH informatics consult form