Empowering informed decisions on investment strategy
When training specific machine-learning models and backtesting quantitative strategies, a foundation of clean data is vital to ensure that results are accurate and new strategies or models are effective.
Any backtests or machine-learning models based on standard datasets will be inaccurate. This is because the figures available today are different to those available in the past.
Most backtests are currently run against standard datasets, whether fundamental or economic, that do not tell the full story behind the figures. These datasets include the latest reported values for any datapoint, but do not show that those values may have been revised and restated several times, for example due to divestitures, mergers and acquisitions or accounting changes. These revisions are common across all industries and can be made weeks, months or even years after the original filing, to significant effect.
In standard datasets, these restated figures will overwrite historical results, meaning that your backtest can only take into account the current value and not the information that was actually available at the time. This has the potential to seriously impact the quality of your research, and the reliability of your results.
What is portfolio backtesting?
Portfolio backtesting is the process of simulating an investment strategy using historical prices to test how well the strategy would have done in the past. Running a simulation over a large number of stocks for past decades is a computationally intensive process. But if you are testing and building models with standard datasets, you are using inaccurate data that may render the results of those tests less reliable – jeopardizing your entire process.
Look ahead bias occurs when research or a simulation relies on data or information that was not yet available or known during the time period being studied. You may be applying lag assumptions to mitigate these risks – however these lags are also imprecise. Complicated by filings regulations that change over time and differ between countries and companies, they are often an ineffective solution to the problem.
Transform the way you backtest quantitative strategies
As the only dataset to include all reported values, timestamped to the moment that they were made available to the market, Refinitiv Point-in-Time data is unique in showing you exactly what information was available and when. This allows you to backtest against real conditions – removing the need for inaccurate lag assumptions, and allowing for more accurate assignment to groupings. This ultimately provides more accurate results, and a truer analysis of how your portfolio or strategy would have performed.
With two Point-in-Time fundamentals databases, and the market’s only Point-in-Time estimates data, we offer the industry’s most comprehensive offering of Point-in-Time data. Every datapoint is regularly updated, and has the high standard of accuracy you expect from Refinitiv.
|Point In time data||Non Point In Time data|
Has all figures, including preliminary results, finalized results and restatements.
Includes only the most recently available figures, overwriting previous results.
|Timestamped to the exact date and time the information was made available.||A lag period must be assumed relative to the filing period end date.|
|Includes data on funds and entities that existed at the time.||Includes data only on funds and entities that exist today.|
What our data covers
- Refinitiv Worldscope Fundamentals: Coverage of almost 90,000 companies, dating back to 1983, timestamped to the day and updated daily.
- Refinitiv Financials: Coverage of over 100,000 companies, dating back to 1980, timestamped to the day and updated daily.
- Refinitiv I/B/E/S: The only Point-in-Time Estimates data available in the industry. Coverage of over 80,000 companies, dating back to 1980, updated weekly with daily timestamping. Economics Point-in-Time Over 2,100 key indicator economic series, with up to 20 years of history.
- Economics: Coverage of 52 countries, dating back to 2010, time stamped to the day and updated on an intraday basis
- Reuters Real-time News: Coverage of over 90,000 companies, dating back to 1996, timestamped to the millisecond and updated in real-time.
- Refinitiv MarketPsych ESG Analytics: More than 20,000 active companies from over 80 countries, with history dating back to 1998.
- The Refinitiv Business Classifications: Covering over 250,000 securities in 130 countries to 5 levels of granularity, The Refinitiv Business Classifications (TRBC) is the most comprehensive, detailed, and up-to-date sector and industry classification available
- Refinitiv MarketPsych Analytics: coverage of 16,000+ companies, 36 commodities and energy subjects, 187 countries, 62 sovereign markets, 45 currencies, 150+ cryptocurrencies, dating back to 2011
Begin portfolio backtesting with confidence
Biases are inherent in standard data, point-in-time data helps eliminate survivorship and look-ahead bias.
Use historic data that is consistent and never overwritten to ensure the results of your models are accurate.
Datasets include preliminary, original, re-statements, reclassifications, and final figures, as far back as 1980.
If we are not using point-in-time data, we could end up with a backtest that is not properly representative of a strategy’s historical performance. Any subsequent trading decision based on such a backtest could be made as a result of inaccurate data. We could end up investing in a strategy that is not as attractive as our statistical analysis suggests, because of hindsight bias induced by using data which isn’t point-in-time.
We're here for you
Have questions? We're here to help. Talk to a real person and get the answers that matter most.