RxDataLab Data Documentation / FDA Drug Exclusivity
FDA Drug Exclusivity
RxDataLab's harmonized, AI-ready FDA exclusivity dataset combines the FDA Orange Book, USPTO patent records, and openFDA drug labels into a single relational database enriched with change history, computed LOE estimates, and normalized company identifiers. Intended for analysts, strategy teams, and researchers who value traceable primary source data.
Drug market exclusivity in the United States is governed by two overlapping but legally distinct systems. Patent protection is a property right granted by the US Patent and Trademark Office (USPTO) that covers the underlying compound, formulation, or method of use. Regulatory exclusivity is a separate set of market protections granted directly by the FDA under the Hatch-Waxman Act (for small molecule drugs) and the Public Health Service Act (for biologics); it delays generic or biosimilar approval regardless of patent status. The two systems interact in practice. A drug may lose patent coverage while retaining regulatory exclusivity, or vice versa and the effective barrier to generic entry persists until both have expired. When we refer to estimated “loss of exclusivity” (LOE) in this document, we mean the date on which both sets of protections have likely lapsed and competitor generics can legally enter the market. Exclusivity is complex, and here we provide only estimates based on available information provided by the FDA and USPTO.
For small molecule drugs regulated under the Federal Food, Drug, and Cosmetic (FD&C) Act, all relevant patents and patent claims must be listed in an FDA publication called the Orange Book. The Orange Book is a primary reference for generic manufacturers, competitive intelligence teams, business development professionals, investors, and patients tracking when branded drugs face competition. The Orange Book is updated continuously with daily incremental changes posted to the FDA website, monthly data releases published as the Electronic Orange Book download, and a full annual edition is released each year. All data is freely available.
To make this data maximally useful for analysis, RxDataLab has built a comprehensive FDA Exclusivity dataset that integrates primary sources, normalizes and enriches the raw records, tracks changes monthly, and exposes everything in a relational format built for analysts and AI workflows.
Data sources
- FDA Electronic Orange Book is the canonical list of approved NDAs and ANDAs with their associated patent listings and exclusivity grants, updated monthly by FDA (see the FAQ and Guidance for Industry)
- openFDA Structured Product Labeling (SPL) contain structured product labeling records linked to new drug application (NDA) numbers. This provides indication text, pharmacological class, boxed warning flags, and important keys/link to other datasets including RxCUI, UNII, and NDC
- USPTO PatentsView is an aggregated grant-level patent dataset including details like (grant date, expiration, title, assignee, application number and more
- RxDataLab Company Enrichment for a subset of companies, RxDataLab provides harmonized names (including subsidiaries acquisitions, mergers) and unique company identifiers such as SEC central index keys (CIK) and stock tickers where available
Update frequency: Monthly, following the FDA Orange Book release cycle (typically around the 25th of each month).
Access: Available as a one-time export or annual data license with monthly updates. See pricing and licensing →
Provenance, Transparency, and AI-Ready Data
RxDataLab does not resell or license third-part data vendor feeds. Our pipeline pulls directly from the FDA, USPTO, and other regulatory sources ensuring data lineage is maintained and records link back to their original source wherever possible. Derived or inferred data have explicit column markings (for example loe_estimates and first generic entry) and the methodology for computing those tables is documented.
AI-ready format
The dataset is delivered as a normalized SQLite database with a defined schema, AGENTS.md file explaining use and caveats, and a set of example queries to get started. The definitions and examples are intended to make it easy for an LLM-based tool to quickly gain necessary context to assist analysts when querying the data. The schema reference includes worked queries for major use cases.
FDA Exclusivity Database Overview
The dataset is organized into five areas:
Applications and products — one row per NDA/ANDA in applications, one row per strength/form/route combination in products. Company names are normalized to a canonical rxdl_company identifier with stock ticker and SEC CIK, enabling joins to other RxDataLab datasets.
Patents — patents listed in the Orange Book, with expiry dates and exclusivity flags. The patent_details table joins to USPTO for grant date, patent title, patent application number, and assignee, which can be useful when tracking patent prosecution.
Exclusivity — FDA statutory exclusivity grants with codes and expiration dates. NCE (5-year new chemical entity), ODE (7-year orphan drug), NPE (3-year new clinical study), GAIN (Generating antibiotic incentives now, 5 year extension) and pediatric (6-month additive) are the most analytically significant. The exclusivity_codes table provides plain-English definitions for all codes.
LOE estimates — the loe_estimates table computes a single expected loss-of-exclusivity date per application: MAX(latest patent expiry + pediatric adjustment, latest exclusivity expiration). This number is often used when building generic entry timelines. Full methodology, with important caveats →
Change history — RxDataLab tracks changes between monthly orangebook releases. We primarily monitor modification and deletion of patents and exclusivity records and store it as a monthly event log. This allows you to see when a patent expiry date was quietly extended, when a new exclusivity was granted, or when an application disappeared from the Orange Book entirely. Change tracking currently begins in October 2023, but RxDataLab is continually working to improve and extend the history.
Tables
| Table | Description |
|---|---|
applications | One row per NDA/ANDA/BLA — top-level approval with company and application metadata |
products | Drug products (strength, form, route) under each application |
patents | Orange Book patent listings with expiry dates and pediatric flag |
patent_details | USPTO grant details joined to patent listings |
exclusivity | FDA exclusivity grants with codes and expiration dates |
exclusivity_codes | Reference: code definitions in plain English |
loe_estimates | Computed loss-of-exclusivity date per application |
drug_labels | openFDA SPL label data joined by NDA number |
*_changes | Monthly change log tables for each core table |