How Does Intrinio Build Standardized Fundamental Data?

Chelsea Caltuna
June 24, 2020

The financial statements that publicly-traded companies file with the SEC can be a goldmine of information. And, like gold, it can take quite a bit of digging and refining to get real value out of those statements. You can spend weeks mapping out this data on your own (which you probably don’t have time for) or pay a firm with thousands of analysts who pore over the filings for you (which can be prohibitively expensive, especially for smaller companies).

Fortunately, there is a third option: machine learning. Intrinio’s fundamental data is powered by our proprietary machine learning technology, which helps us maintain a high level of quality while keeping costs low (which could mean five-figure savings for your business every year).

Read on to learn how we process fundamentals and what’s included in our fundamentals feed.

How Intrinio Processes Fundamentals

The SEC requires publicly-traded companies to file certain forms each year. Most investment firms (and other companies that make decisions using financial data) are interested in the 10-K and 10-Q filings, which represent annual and quarterly financial statements, respectively.

Companies file these forms in two formats. The first is an unstructured PDF that can really only be analyzed manually – great for full-time analysts, not so convenient for everyone else. The second is based on eXtensible Business Reporting Language, or XBRL. This format uses eXtensible Markup Language (XML) to help businesses report on their activities in a more consistent way.

XML can be, to put it lightly, difficult to work with. For example, if you look up the 2019 10-K for Apple, you’ll get something like this:

A bit of a headache, right? That’s why the SEC will usually parse this information out and show a cleaner version that’s easier to digest:

If you’re looking at one company, for one to three years at a time, the SEC interface is fine. If you want to go back more than three years, you’ll have to comb through multiple filings, but it’s doable. If you want to compare filings between multiple companies – well, that’s where it gets tricky.

Even using the XBRL taxonomy, no company files statements the same way. If you want to, say, compare Apple’s R&D expenses to Microsoft’s, you’ll quickly find that companies label these financials differently. That makes a direct comparison basically impossible.

To enable this comparison, Intrinio offers standardized financial statements. We parse out the information that is particularly relevant and useful to our customers and use propriety machine learning algorithms, such as multi-class classification, to map out the reported data against a standard set of terms that we use for all of the companies we cover. We’ve spent years training our algorithm with correct data to help it detect hidden patterns that are often not intuitive to humans.

Traditional data firms don’t use algorithms – they have teams of 100 to 1,000 people who map financial statements by hand. As we mentioned in the outset, this can be very expensive for you as the user. Our proprietary XBRL processing engine allows us to read these complex XBRL files at scale and extract relevant information. Any potential issues are flagged, then reviewed and fixed by a (human) team member. Our accuracy rate is roughly 99.8% before human review - comparable to a team of analysts, but without the steep data costs.

In addition to the standardized financials, we calculate more than a hundred investment metrics and performance ratios to add value to your analysis. All calculations are comparable across filings and companies. We also calculate several fiscal periods that are not reported. For example, companies are required to report Q1, Q2, Q3, and FY, but not Q4. We calculate the last quarter of the fiscal year to give you a more complete picture.

We have our own database of companies and SEC filings, so you can access as-reported financials at any time. When new filings are submitted to the SEC, that data is available to you, on average, within fifteen minutes.

Inside Our Fundamentals

Our fundamentals data includes standardized and as-reported 10-K and 10-Q statements for all US public companies. Get income statements, balance sheets, cash flow statements, and metrics such as market capitalization, EBITDA, and ROI. OTC securities, ETFs, and ADRs/foreign issuers are not currently covered. We offer history going as far back as the company has been filing in XBRL.

This dataset features a huge variety of supplementary data, including:

  • Hundreds of financial metrics & ratios
  • Company reference & metadata
  • Company news
  • Insider & institutional ownership
  • Institutional holdings
  • Upcoming & historical IPOs
  • Raw 10-Q, 10-K, 8-K, 20-F & 40-F text  
  • Searchable 10-Q & 10-K notes
  • Sector & industry rankings
  • Economic data

Testing and Accessing Fundamental Data

Intrinio’s fundamentals are available via API, bulk file downloads, direct database access, and FTP. Our tools are built by developers, for developers, and we strive to provide the most advanced data delivery platforms.

Here are a few other useful links:

API Documentation

API Endpoints

All Filings by Company

All Fundamentals by Company

Lookup Fundamental by Company

Software Development Kits







Data Tags

Ready to get started? Explore our equities data packages.

Sorry, we no longer support Internet Explorer as a web browser.

Please download one of these alternatives and return for the full Intrinio Experience.

Google Chrome web browser icon
Mozilla Firefox web browser icon
Safari web browser icon
Microsoft Edge web browser icon
Microsoft Edge