All Insights

Beyond Compliance: Designing a FHIR Data Architecture Built to Scale

FHIR-Data-Architecture
4 min read

FHIR compliance and FHIR architecture are not the same problem. A FHIR architecture designed only to satisfy current regulatory requirements will require significant rework when analytics, AI, additional use cases, or R5 migration become priorities. The cost of architectural debt in FHIR is not paid at go-live—it is paid when the organization tries to expand what FHIR enables. This article covers the architectural decisions that determine whether a FHIR implementation is a compliance checkbox or a durable platform for clinical and operational capability.

FHIR server selection is the most consequential architectural decision in a FHIR implementation. The FHIR server defines what operations are supported, what performance characteristics are achievable, what implementation guides can be supported, and what the long-term operational cost will be.

FHIR Server OptionStrengthsLimitationsBest Fit
HAPI FHIR (open source)Full R4/R5 support, active community, no licensing costRequires internal expertise to operate and tuneOrganizations with Java engineering capacity and ops maturity
Smile Digital HealthEnterprise support, implementation guide tooling, strong profiling supportLicensing cost, less flexible for custom extensionsMid-to-large health systems needing enterprise support
Azure Health Data ServicesManaged cloud, HIPAA BAA, integrated with Azure ecosystemAzure lock-in, limited flexibility for complex profilingOrganizations standardized on Azure with moderate FHIR complexity
AWS HealthLakeManaged, NLP integration, bulk data supportFHIR R4 feature completeness gaps vs. alternativesAnalytics-first use cases in AWS environments
Google Cloud Healthcare APIFHIR R4, DLP integration, BigQuery connectivityGoogle ecosystem dependency, FHIR support depth variesAnalytics pipelines integrated with BigQuery

FHIR resource profiling constrains and extends base FHIR R4 resources to fit specific organizational requirements. Profiling is not optional for production FHIR—it is the mechanism by which an organization defines what data quality means for each resource type and enforces it throughout the data pipeline.

U.S. Core Data for Interoperability (USCDI) profiles, published by HL7 and ONC, define the minimum data elements required for U.S. regulatory compliance. These profiles are the starting point, not the endpoint. Organizations need organization-specific profiles that extend U.S. Core to cover their specific clinical use cases, EHR data patterns, and downstream consumer requirements.

  • Define profiles before build begins: Profiles created after build are retrofitted, which reduces their effectiveness as quality governance tools.
  • Profile every resource type in the use case scope: Partial profiling creates inconsistent data quality across the dataset.
  • Validate source data against profiles during build: Profile validation in a continuous integration pipeline catches quality issues early.
  • Version profiles and maintain a registry: Profiles evolve as use cases expand. Version control and a discoverable registry are operational necessities.

FHIR resources reference clinical terminologies—LOINC for laboratory observations, SNOMED CT for clinical findings, RxNorm for medications, ICD-10-CM for diagnoses, CPT for procedures. Without a terminology service, each integration maps source codes to standard codes independently, producing inconsistent results that break cross-system queries and analytics.

A terminology service provides a single, authoritative mapping and validation layer for all FHIR resources in the organization. Terminology services include concept lookup, code validation, translation between coding systems, value set management, and subsumption queries that support clinical decision support.

  • FHIR Terminology Server: The FHIR specification includes a terminology service API. Standalone FHIR terminology servers (Ontoserver, HAPI FHIR with terminology support) provide this capability.
  • EHR native terminology: Some EHR platforms provide terminology services that can be queried via FHIR API. Coverage and accuracy vary by EHR and terminology domain.
  • Commercial services: National Library of Medicine’s VSAC, Apelon TDE, and clinical terminology vendors provide terminology services with varying coverage and licensing models.

The highest-value FHIR use cases require longitudinal patient records—a coherent view of a patient’s clinical history across encounters, providers, and time. Building longitudinal patient records from FHIR requires patient matching, deduplication, and temporal ordering that are not provided by the FHIR standard itself.

  • Patient identity management: A Master Patient Index (MPI) or probabilistic patient matching service is required to link patient records across contributing systems. FHIR does not define a patient matching algorithm, though the $match operation provides a standard interface.
  • Encounter and episode linking: FHIR Encounter resources need to be linked across contributing systems to build a coherent clinical timeline.
  • Provenance tracking: The FHIR Provenance resource records the source, actor, and timing of each clinical event. Provenance is required for regulatory compliance in some contexts and for clinical trust in longitudinal data.

FHIR R4 is the current standard. FHIR R5 is published and will be the basis for future U.S. regulatory requirements. AI and machine learning use cases are driving new demands on FHIR architectures—standardized features, bulk data access, and fine-grained temporal data that R4 architectures often do not provide.

  • Build on FHIR R4 with documented deviation points: Know where your implementation deviates from R4 normative behavior so R5 migration planning starts from accurate baseline.
  • Use Bulk Data Export from the start: FHIR Bulk Data Access IG is the path to analytics and AI training data. Architectures that support bulk export are positioned for AI use cases without rebuilding.
  • Separate the FHIR API layer from the clinical data store: A clean separation between the FHIR facade and the underlying data store allows the store to evolve without breaking the API layer.

FAQ

Should we build a FHIR repository or use our EHR as the FHIR source of truth?

This is a fundamental architectural choice. Using the EHR as the FHIR source of truth (querying EHR FHIR APIs directly) minimizes data duplication and synchronization complexity but creates dependency on EHR API performance and availability. A separate FHIR repository provides a controlled, profiled dataset but requires synchronization and introduces latency. Most production architectures use a hybrid—EHR APIs for point-of-care use cases, FHIR repository for analytics and population health.

What is the right granularity for FHIR resource profiling?

Profile at the level of your use case requirements, not at the level of every possible FHIR element. Over-profiling creates maintenance burden without proportional quality benefit. Under-profiling leaves data quality gaps that manifest as clinical errors. The right granularity is the set of constraints that, if violated, would cause a consuming application to produce incorrect clinical output.

How do we handle FHIR data from multiple EHR vendors?

Multi-EHR environments require a normalization layer that translates vendor-specific FHIR extensions, coding variations, and structural quirks into a common organizational profile. This normalization is the most technically complex component of a multi-EHR FHIR architecture and is where vendor-specific experience pays the largest dividend.

FHIR architectures built only to satisfy current requirements age poorly. Architectures built with an explicit view of the 3- to 5-year use case roadmap—analytics, AI, R5, additional integration partners—require more upfront investment in design but deliver substantially lower total cost of evolution. Clinovera’s architecture engagements explicitly address the future-readiness question, not just the compliance question.

Contact Clinovera to discuss your FHIR architecture requirements and roadmap.

July 2026

Start a conversation today