Data Sources & Attribution
Every fact, figure, and entity on 4ort.xyz comes from open data sources we cite below. We blend CC0 datasets with US Federal public-domain APIs to build a knowledge graph for AI agents. No scraped private content, no proprietary feeds, no licensing surprises.
Architectural principle:
Wikidata is our source-of-truth for entities. Federal APIs (BLS, BEA, Census,
EIA, Treasury, FRB) supply economic indicators. SEC EDGAR provides corporate filings.
Wikimedia provides imagery. Each layer is open or government public domain โ every
redistribution is legally clean.
โญ Pure CC0 / Public Domain (no attribution required)
Wikidata CC0 1.0 โ wikidata.org
Use: Entity facts, relationships, claims, qualifiers, sitelinks, aliases,
descriptions, multilingual labels, P279 class hierarchy. The structural backbone of the entire knowledge graph.
Wikipedia Pageviews CC0 โ dumps.wikimedia.org
Use: Hourly Wikipedia pageview data โ the "trending now" signal,
Popularity Graph score's pageview component, /trending feed.
Wikipedia Clickstream CC0 โ dumps.wikimedia.org
Use: Reader navigation graph โ "Readers Also Explored" sidebar
panel, association metrics between entities.
๐๏ธ US Federal Public Domain (no key, free redistribution)
Federal Reserve Board Public Domain โ federalreserve.gov
Use: H.15 Selected Interest Rates โ Federal Funds Rate, mortgage rates,
AAA/BAA corporate bonds, Treasury yields. (Note: distinct from St. Louis Fed's FRED service.)
FRB notice: Information on the Board's website is in the public domain
and may be copied and distributed without permission. Source: Board of Governors of the
Federal Reserve System.
US Treasury Fiscal Data Public Domain โ fiscaldata.treasury.gov
Use: Treasury yields (1mo, 3mo, 6mo, 1y, 5y, 10y, 30y), national debt,
exchange rates. No API key required.
SEC EDGAR Public Domain โ sec.gov/edgar
Use: Company tickers + CIKs (Phase 1), recent filings 8-K/10-K/10-Q/Form 4
(Phase 2), XBRL financials (Phase 3). Powers SEC EDGAR sidebar block on every public-company entity page.
USDA FoodData Central Public Domain โ fdc.nal.usda.gov
Use: 389,000 branded food products with full nutrition labels (calories,
macros, micros, serving size, ingredients). Ingested as first-class entities โ every food
product gets its own entity page with USDA-sourced facts.
openFDA NDC Drug Directory Public Domain โ open.fda.gov
Use: 39,000 FDA-approved drugs with NDC codes, active ingredients, dosage forms,
manufacturer/labeler, marketing status. Drug entities ingested with structured product data
and cross-referenced against Wikidata where possible.
NHTSA vPIC Vehicle Catalog Public Domain โ vpic.nhtsa.dot.gov
Use: 42,000 vehicle models from the federal Vehicle Product Information
Catalog โ make, model, year, body type, drivetrain. Powers vehicle entity pages.
CIA World Factbook Public Domain โ cia.gov/the-world-factbook
Use: Country reference data for ~250 nations โ government structure,
economy (GDP composition, exports, currency), people (population, life expectancy,
literacy), geography. Renders on country entity pages as a structured Factbook panel.
๐ Federal APIs (require attribution disclaimers)
U.S. Bureau of Labor Statistics (BLS) Federal API โ bls.gov
Use: CPI inflation, unemployment rate, jobs report, employment cost,
productivity. Foundation of /economy macro indicators.
BLS-required disclaimer: "BLS.gov cannot vouch for the data or analyses
derived from these data after the data have been retrieved from BLS.gov."
U.S. Bureau of Economic Analysis (BEA) Federal API โ bea.gov
Use: GDP, GNP, personal income, savings rate, trade balance,
regional economic data.
BEA-required disclaimer: "This product uses the Bureau of Economic
Analysis (BEA) Data API but is not endorsed or certified by BEA."
U.S. Census Bureau Federal API โ census.gov
Use: Aggregate retail sales, housing starts, ACS demographics,
industry-level business statistics. Used in aggregate form only โ never combined
to identify individuals (per Census re-identification rule).
Census-required disclaimer: "This product uses the Census Bureau Data API
but is not endorsed or certified by the Census Bureau."
U.S. Energy Information Administration (EIA) Federal API โ eia.gov
Use: Petroleum spot prices, natural gas, electricity, gasoline retail,
refined-products markets. Source: U.S. Energy Information Administration.
๐ผ๏ธ Imagery
Wikimedia Commons (Public Domain only) PD-filtered โ commons.wikimedia.org
Use: Hero images on entity pages. ~149,000 PD images currently active;
~28,000 non-PD images were purged for commercial-API safety and queued for re-harvest with PD-only criteria.
๐ Web Authority Signals (free with attribution, commercial OK)
DomCop OpenPageRank Free + Attribution โ Open PageRank
Use: Top 10,000,000 domains ranked by Open PageRank score
(a PageRank-style authority metric derived from the Common Crawl link graph).
Powers our search-result ranking layer โ surfacing authoritative sources first
when an AI agent queries for an entity. Refreshed monthly.
๐ผ Contract / Paid Sources
DataForSEO Paid Contract โ dataforseo.com
Use: Google search volume, keyword difficulty, CPC, search intent for ~43,000 entities.
Powers the "๐ Search Volume" sidebar block.
๐ Global CC0 Reference Data (no attribution required)
GLEIF / Legal Entity Identifiers CC0 1.0 โ gleif.org
Use: 2.5M+ Legal Entity Identifiers for corporate entities globally,
including parent/subsidiary hierarchy + ISIN, BIC, MIC mappings. Refreshed daily.
Used to canonicalize companies across jurisdictions and link to financial-market identifiers.
๐ฎ Coming Soon (planned ingest, all CC0/PD)
MusicBrainz CC0
Music metadata โ track-level credits, releases, labels for 2M+ artists.
Open Library CC0
30M+ book records with ISBN, editions, cover images, author bibliographies.
OpenAlex CC0
Academic citation graph โ papers, authors, h-index, affiliations.
ROR (Research Organization Registry) CC0
Canonical IDs for 110k research institutions worldwide.
ORCID CC0 Public File
17M+ researcher IDs and affiliations.
PubMed/MEDLINE Public Domain
35M+ biomedical papers and clinical research records.
ClinicalTrials.gov Public Domain
500k+ clinical trial records, daily-refreshed.
USPTO PatentsView Public Domain
Patents linked to inventors and assignee entities.
Natural Earth Public Domain
Country and region boundary maps in pure public domain โ replaces non-PD geo basemaps.
๐ฏ What we DON'T use
No scraped Wikipedia article text (we use Wikidata's structured data only). No FRED (terms prohibit AI use; we use Federal Reserve Board sources directly instead). No proprietary financial feeds. No private surveillance data. No gated APIs that conflict with our open-data redistribution model.