AI Engineering 12 min read

Cloud in Healthcare: Architecture Decisions, Compliance Requirements, and the Engineering Reality of HIPAA, FHIR, and AI-Cleared Medical Software

The global healthcare cloud market is $63.55 billion in 2025, projected at $74.57 billion in 2026. The FDA cleared 295 AI/ML medical devices in 2025 alone. Epic runs 65 million references per second on Azure. SEPA-equivalent regulatory deadlines — CMS Prior Authorization API mandates take effect January 2026. Here is the engineering architecture behind compliant healthcare cloud systems.

Cloud in Healthcare: Architecture Decisions, Compliance Requirements, and the Engineering Reality of HIPAA, FHIR, and AI-Cleared Medical Software

The global healthcare cloud computing market was valued at $63.55 billion in 2025, growing from $54.28 billion in 2024. It is projected to reach $74.57 billion in 2026 and expand to $251–$313 billion by 2034–2035, at a 16–17.7% CAGR depending on the study. North America holds 41% of the global market.

The growth trajectory is driven by multiple compounding forces: the consolidation of healthcare AI workloads into cloud-hosted inference pipelines; regulatory mandates for open API interoperability (CMS-0057-F); the EHR vendor ecosystem’s migration to cloud-hosted deployment models; and the ongoing capital efficiency argument for health systems that would rather not own data centers.

What makes healthcare cloud engineering categorically different from standard cloud engineering is the regulatory and compliance overlay: HIPAA Business Associate Agreements (BAAs), FHIR R4 interoperability requirements, FDA clearance requirements for AI features used in clinical workflows, and an enforcement environment for information blocking that carries civil monetary penalties up to $1 million per violation. These requirements do not change the fundamental cloud engineering patterns — they add mandatory constraints at every layer of the stack.


The HIPAA Compliance Layer: What It Actually Requires

HIPAA does not mandate specific technology choices. It mandates administrative, physical, and technical safeguards for Protected Health Information (PHI). The practical implications for engineering teams:

Business Associate Agreements (BAAs) are the entry point. Every cloud vendor that creates, receives, maintains, or transmits PHI must sign a BAA before a health system can use the service. AWS, Microsoft Azure, and Google Cloud all offer BAAs for their core services, but the BAA coverage does not extend automatically to every managed service those platforms offer. Teams must audit which specific services are in-scope for the BAA and which are not — a surprise exclusion for a logging or analytics service creates a HIPAA exposure.

Encryption requirements: PHI must be encrypted at rest and in transit. This is table stakes for any cloud deployment — TLS 1.2+ for transit, AES-256 for at-rest encryption — and all three major cloud providers default to this. The more nuanced requirement is key management: HIPAA auditors increasingly expect customer-managed encryption keys (CMK) with documented key rotation policies for PHI datastores, rather than provider-managed defaults.

Audit logging: The HIPAA Security Rule requires audit logs of access to PHI — who accessed what record, when, from which system. CloudTrail (AWS), Azure Monitor (Activity Logs), and Google Cloud Audit Logs satisfy this requirement for control plane operations. Application-level access logging — tracking which user within a health system application viewed which patient record — must be implemented at the application layer and requires explicit engineering investment.

Data residency: HIPAA itself does not mandate US-only data storage — it requires safeguards, not geography. However, health systems routinely negotiate contractual data residency clauses into BAAs for risk management purposes. Organizations handling EU patient data face GDPR data transfer restrictions on top of HIPAA. Engineering teams working with multi-national health systems must handle this at the data classification and routing layer, not just at the procurement layer.

The February 2024 Change Healthcare breach — the largest healthcare data breach in US history — is the canonical risk case. ALPHV/BlackCat ransomware gained access through a Citrix remote access portal with no multi-factor authentication. Nine days of undetected dwell time preceded the breach. MFA enforcement on every system that touches PHI, combined with network segmentation that limits lateral movement from a compromised credential, is the minimum 2025 security baseline for healthcare cloud infrastructure.


Cloud Provider Selection: How the Major Platforms Differentiate in Healthcare

AWS HealthLake is an FHIR R4-compliant managed data store with integrated NLP via Amazon Comprehend Medical — it automatically extracts medical entities (conditions, medications, diagnoses, procedures) from unstructured text inside FHIR DocumentReference resources. AWS was named Best in KLAS for Public Cloud in 2025.

Two production benchmarks: MHK (a managed healthcare company) accelerated its payer interoperability work by nine months using HealthLake. Greenway Health achieved transaction speeds exceeding 8,000 per second on the platform. AWS has emerged as the preferred platform for operational AI agents — scheduling automation, prior authorization processing, and patient communication systems.

Microsoft Azure Health Data Services unifies FHIR R4, DICOM, and MedTech (IoT) under a single managed umbrella. The marquee differentiator is the Epic-Azure partnership: Microsoft expanded Azure capacity for Epic’s Chronicles Operational Database to 65 million global references per second — a 171% increase from 2023 — using the Mbv3 VM series. Mount Sinai Health System was the first to deploy Epic on Azure Large Instances.

DAX Copilot, Microsoft’s ambient clinical documentation product, is embedded directly into Epic via Azure OpenAI Service, creating a tight integration between Microsoft’s AI infrastructure and the dominant EHR platform. FHIR data synchronizes automatically to Microsoft Fabric’s OneLake for unified analytics. Azure is the natural fit for organizations deep in Microsoft 365 and Azure AD ecosystems.

Google Cloud Healthcare API is unique in supporting FHIR R4, DICOM, and HL7 v2 message storage natively — the only major provider with native HL7 v2 store, which remains the dominant message format for lab results and ADT notifications from legacy hospital systems. MedLM, Google’s family of healthcare-fine-tuned foundation models on Vertex AI, is being tested by HCA Healthcare and Accenture for medical documentation and health-plan chatbots. Google Cloud is the preferred platform for population health analytics and clinical research — the BigQuery + Healthcare API pipeline creates the tightest path from raw FHIR data to machine learning.

The emerging market segmentation as of 2025: Microsoft for ambient clinical documentation; AWS for patient-facing operational agents; Google for analytics, research, and AI model development.


FHIR R4: The Technical Standard That Is Reshaping Integration

FHIR R4 (Fast Healthcare Interoperability Resources, Release 4) is the regulatory and technical standard that the ONC 21st Century Cures Act mandated for certified EHR vendors. Understanding it as an engineering standard rather than a regulatory checkbox matters:

Oracle Health (Cerner) deprecated FHIR DSTU2 by December 2025, making FHIR R4 the required integration standard. Oracle’s Ignite APIs provide RESTful R4 endpoints, bulk exports, and SMART app launches.

Epic fully supports FHIR R4 with read/search across most US Core profiles and a growing write subset. Epic’s marketplace hosts 790 apps, including 344 new listings since 2024. The SMART on FHIR protocol — which defines how third-party apps authenticate against an EHR and access patient context — is the standard for clinical application integration and is actively supported by Meditech, NextGen, and Nextech alongside Epic and Cerner.

Bulk FHIR (FHIR Bulk Data Access specification) enables population-level data export — the mechanism by which a health system extracts data for analytics, research, or payer interoperability without building custom ETL pipelines per data element. The CMS-0057-F Payer-to-Payer API (effective January 2027) requires Bulk FHIR as the exchange mechanism.

For engineering teams building healthcare integrations: FHIR R4 means RESTful API patterns, JSON or XML resource representations, OAuth 2.0 + SMART for authentication, and a defined resource model (Patient, Encounter, Observation, Condition, MedicationRequest) for common clinical concepts. The complexity is in the gaps: resource profiles (US Core constrains FHIR to specific field requirements), terminology mapping (SNOMED CT, LOINC, ICD-10, RxNorm), and the practical reality that two Epic installations at different health systems may expose slightly different FHIR profiles based on their configuration and data governance choices.


CMS Interoperability Mandates: The Engineering Deadlines

The CMS Interoperability and Prior Authorization Final Rule (CMS-0057-F, January 17, 2024) mandates four new APIs for payers:

  1. Patient Access API — patients can access their claims, clinical, and formulary data via FHIR R4
  2. Provider Access API — providers can access patient data from payers to support care coordination
  3. Payer-to-Payer API — payers must share patient data when a member changes plans
  4. Prior Authorization API — payers must process prior authorization requests digitally and return decisions within defined timeframes

The engineering deadline structure: the Prior Authorization digital exchange rules take effect January 1, 2026, with all four APIs required to be live by January 1, 2027. The projected savings from this mandate is $15 billion over 10 years — driven primarily by reduction in administrative overhead for prior authorization processing, which currently consumes approximately 2 hours of physician time per week and 16 hours of nursing time per week at a typical US practice.

For health IT vendors and payers who have not yet started implementation: the January 2026 deadline is not aspirational. Non-compliant systems face CMS enforcement actions, and healthcare providers whose payers are not compliant face downstream disruption to their prior authorization workflows.


Healthcare AI: FDA-Cleared Devices and the Production Architecture

By the end of 2025, the FDA had cleared 1,451 cumulative AI/ML-enabled medical devices. In 2025 alone, the FDA granted clearance to 295 AI/ML devices — approximately one new clearance per business day. Radiology dominates: approximately 956 of all cleared devices are radiology tools (imaging analysis, triage, measurement automation). 62% are classified as Software as a Medical Device (SaMD), meaning cloud-hosted, software-only tools rather than physical hardware. 97% were cleared via the 510(k) pathway, leveraging predicates rather than full clinical trials.

Almost all cleared AI devices are decision-support tools — flagging findings for clinician review — rather than autonomous diagnosticians. The FDA’s framework explicitly constrains autonomy: the cleared devices assist trained clinicians in making decisions, they do not replace those decisions. Engineering teams building healthcare AI features should plan for this constraint at the product design layer, not as an afterthought.

The cloud architecture that supports FDA-cleared SaMD: immutable audit logs of model inference (what input produced what output, with version tracking for the model), human-in-the-loop workflow integration (the AI output feeds into a clinician review queue, not directly into clinical action), model performance monitoring over time (drift detection to flag when production data distribution diverges from training data), and documentation systems that support 510(k) submission or De Novo request requirements.


The Build-vs-Buy Decision in Healthcare Engineering

The healthcare technology stack has a higher proportion of vendor solutions than most other industries, for a defensible reason: building HIPAA-compliant data infrastructure, FHIR-native EHR integrations, and FDA clearance documentation from scratch is expensive relative to the value it adds above the infrastructure layer. The engineering decisions with genuine build-versus-buy optionality are at the application layer — the specific workflows, decision logic, user experience, and AI features that differentiate the product.

70% of US health systems and hospitals are planning to implement cloud-based supply chain solutions by 2026, according to a GHX survey. The implementation work — cloud architecture, EHR integration, FHIR mapping, HIPAA compliance engineering, AI pipeline development — is where specialized engineering expertise creates measurable value.

Insoftex works with healthcare technology companies on cloud architecture, FHIR integration engineering, HIPAA compliance infrastructure, and healthcare AI product development. Book a 30-min technical call to discuss your healthcare engineering requirements.

Let's talk about your AI roadmap.

We work with funded SaaS companies and regulated enterprises building AI that ships — not AI that demos.

Press Esc to close