Data Sources & Attribution

Every fact, figure, and entity on 4ort.xyz comes from open data sources we cite below. We blend CC0 datasets with US Federal public-domain APIs to build a knowledge graph for AI agents. No scraped private content, no proprietary feeds, no licensing surprises.

Architectural principle: Wikidata is our source-of-truth for entities. Federal APIs (BLS, BEA, Census, EIA, Treasury, FRB) supply economic indicators. SEC EDGAR provides corporate filings. Wikimedia provides imagery. Each layer is open or government public domain โ€” every redistribution is legally clean.

โญ Pure CC0 / Public Domain (no attribution required)

Wikidata CC0 1.0 โ†’ wikidata.org

License: CC0 1.0 Universal ยท Operator: Wikimedia Foundation
Use: Entity facts, relationships, claims, qualifiers, sitelinks, aliases, descriptions, multilingual labels, P279 class hierarchy. The structural backbone of the entire knowledge graph.

Wikipedia Pageviews CC0 โ†’ dumps.wikimedia.org

License: CC0 ยท Operator: Wikimedia Foundation
Use: Hourly Wikipedia pageview data โ€” the "trending now" signal, Popularity Graph score's pageview component, /trending feed.

Wikipedia Clickstream CC0 โ†’ dumps.wikimedia.org

License: CC0 ยท Operator: Wikimedia Foundation
Use: Reader navigation graph โ€” "Readers Also Explored" sidebar panel, association metrics between entities.

๐Ÿ›๏ธ US Federal Public Domain (no key, free redistribution)

Federal Reserve Board Public Domain โ†’ federalreserve.gov

License: US Federal works โ€” public domain (17 USC ยง105)
Use: H.15 Selected Interest Rates โ€” Federal Funds Rate, mortgage rates, AAA/BAA corporate bonds, Treasury yields. (Note: distinct from St. Louis Fed's FRED service.)
FRB notice: Information on the Board's website is in the public domain and may be copied and distributed without permission. Source: Board of Governors of the Federal Reserve System.

US Treasury Fiscal Data Public Domain โ†’ fiscaldata.treasury.gov

License: US Federal works โ€” public domain (17 USC ยง105)
Use: Treasury yields (1mo, 3mo, 6mo, 1y, 5y, 10y, 30y), national debt, exchange rates. No API key required.

SEC EDGAR Public Domain โ†’ sec.gov/edgar

License: US Federal works โ€” public domain (17 USC ยง105)
Use: Company tickers + CIKs (Phase 1), recent filings 8-K/10-K/10-Q/Form 4 (Phase 2), XBRL financials (Phase 3). Powers SEC EDGAR sidebar block on every public-company entity page.

USDA FoodData Central Public Domain โ†’ fdc.nal.usda.gov

License: US Federal works โ€” public domain (17 USC ยง105)
Use: 389,000 branded food products with full nutrition labels (calories, macros, micros, serving size, ingredients). Ingested as first-class entities โ€” every food product gets its own entity page with USDA-sourced facts.

openFDA NDC Drug Directory Public Domain โ†’ open.fda.gov

License: US Federal works โ€” public domain. Free API.
Use: 39,000 FDA-approved drugs with NDC codes, active ingredients, dosage forms, manufacturer/labeler, marketing status. Drug entities ingested with structured product data and cross-referenced against Wikidata where possible.

NHTSA vPIC Vehicle Catalog Public Domain โ†’ vpic.nhtsa.dot.gov

License: US Federal works โ€” public domain. Free API.
Use: 42,000 vehicle models from the federal Vehicle Product Information Catalog โ€” make, model, year, body type, drivetrain. Powers vehicle entity pages.

CIA World Factbook Public Domain โ†’ cia.gov/the-world-factbook

License: US Federal works โ€” public domain (17 USC ยง105)
Use: Country reference data for ~250 nations โ€” government structure, economy (GDP composition, exports, currency), people (population, life expectancy, literacy), geography. Renders on country entity pages as a structured Factbook panel.

๐Ÿ“Š Federal APIs (require attribution disclaimers)

U.S. Bureau of Labor Statistics (BLS) Federal API โ†’ bls.gov

License: US Federal works โ€” public domain. Free API key required.
Use: CPI inflation, unemployment rate, jobs report, employment cost, productivity. Foundation of /economy macro indicators.
BLS-required disclaimer: "BLS.gov cannot vouch for the data or analyses derived from these data after the data have been retrieved from BLS.gov."

U.S. Bureau of Economic Analysis (BEA) Federal API โ†’ bea.gov

License: US Federal works โ€” public domain. Free API key required.
Use: GDP, GNP, personal income, savings rate, trade balance, regional economic data.
BEA-required disclaimer: "This product uses the Bureau of Economic Analysis (BEA) Data API but is not endorsed or certified by BEA."

U.S. Census Bureau Federal API โ†’ census.gov

License: US Federal works โ€” public domain. Free API key required.
Use: Aggregate retail sales, housing starts, ACS demographics, industry-level business statistics. Used in aggregate form only โ€” never combined to identify individuals (per Census re-identification rule).
Census-required disclaimer: "This product uses the Census Bureau Data API but is not endorsed or certified by the Census Bureau."

U.S. Energy Information Administration (EIA) Federal API โ†’ eia.gov

License: US Federal works โ€” public domain. Free API key required.
Use: Petroleum spot prices, natural gas, electricity, gasoline retail, refined-products markets. Source: U.S. Energy Information Administration.

๐Ÿ–ผ๏ธ Imagery

Wikimedia Commons (Public Domain only) PD-filtered โ†’ commons.wikimedia.org

License: We display ONLY public-domain files. CC-BY/CC-BY-SA/nominative-use files purged from our index.
Use: Hero images on entity pages. ~149,000 PD images currently active; ~28,000 non-PD images were purged for commercial-API safety and queued for re-harvest with PD-only criteria.

๐Ÿ”— Web Authority Signals (free with attribution, commercial OK)

DomCop OpenPageRank Free + Attribution โ†’ Open PageRank

License: Free for commercial use with attribution per DomCop's terms. Storage on our servers explicitly permitted.
Use: Top 10,000,000 domains ranked by Open PageRank score (a PageRank-style authority metric derived from the Common Crawl link graph). Powers our search-result ranking layer โ€” surfacing authoritative sources first when an AI agent queries for an entity. Refreshed monthly.

๐Ÿ’ผ Contract / Paid Sources

DataForSEO Paid Contract โ†’ dataforseo.com

License: Commercial contract
Use: Google search volume, keyword difficulty, CPC, search intent for ~43,000 entities. Powers the "๐Ÿ“Š Search Volume" sidebar block.

๐ŸŒ Global CC0 Reference Data (no attribution required)

GLEIF / Legal Entity Identifiers CC0 1.0 โ†’ gleif.org

License: CC0 1.0 ยท Operator: Global Legal Entity Identifier Foundation
Use: 2.5M+ Legal Entity Identifiers for corporate entities globally, including parent/subsidiary hierarchy + ISIN, BIC, MIC mappings. Refreshed daily. Used to canonicalize companies across jurisdictions and link to financial-market identifiers.

๐Ÿ”ฎ Coming Soon (planned ingest, all CC0/PD)

MusicBrainz CC0

Music metadata โ€” track-level credits, releases, labels for 2M+ artists.

Open Library CC0

30M+ book records with ISBN, editions, cover images, author bibliographies.

OpenAlex CC0

Academic citation graph โ€” papers, authors, h-index, affiliations.

ROR (Research Organization Registry) CC0

Canonical IDs for 110k research institutions worldwide.

ORCID CC0 Public File

17M+ researcher IDs and affiliations.

PubMed/MEDLINE Public Domain

35M+ biomedical papers and clinical research records.

ClinicalTrials.gov Public Domain

500k+ clinical trial records, daily-refreshed.

USPTO PatentsView Public Domain

Patents linked to inventors and assignee entities.

Natural Earth Public Domain

Country and region boundary maps in pure public domain โ€” replaces non-PD geo basemaps.

๐ŸŽฏ What we DON'T use

No scraped Wikipedia article text (we use Wikidata's structured data only). No FRED (terms prohibit AI use; we use Federal Reserve Board sources directly instead). No proprietary financial feeds. No private surveillance data. No gated APIs that conflict with our open-data redistribution model.