Skip to content

How robust is AI to altered data in finance?

We put four leading LLMs to the test—ChatGPT, Claude AI, Gemini, and aisot's Matterhorn—using a real NVIDIA news article.

LLM

The task all  four LLMs—ChatGPT, Claude AI, Gemini, and aisot's Matterhorn— received was simple: predict the price impact in percent.

First we used a  real NVIDIA news article.

Then we made subtle changes to the article—the kind someone might use to influence AI-driven analysis. What happened next reveals an important difference between general-purpose and specialized AI for financial markets.

Watch what happens when one number changes. When one false statement appears.

The results might surprise you. Stay tuned to the end of the video to understand why.

This is why specialization matters in finance.

Watch the full demonstration.