Artificial Intelligence

MIT and IBM Develop Dataset to Boost AI Chart Interpretation

Researchers from MIT and the MIT-IBM Computing Research Lab have developed ChartNet, a large-scale synthetic dataset designed to train vision-language models (VLMs) to interpret charts more effectively. ChartNet enables smaller, open-source AI models to outperform much larger commercial systems in tasks such as extracting data, summarizing charts, and answering questions related to chart visuals.

What happened

To address the challenge of accurately interpreting multimodal chart data, the research team created a dataset containing over one million diverse synthetic chart images. Each chart in ChartNet is accompanied by the code used to generate it, detailed textual descriptions, numerical tables, and question-and-answer pairs to train AI models comprehensively.

The dataset was generated through a two-step automated pipeline: first converting existing charts into code, then augmenting them by varying chart types, data values, colors, and topics. This approach produced a broad range of chart examples while maintaining high-quality visual and numerical coherence, verified via automated quality control. Additionally, a smaller subset of charts was manually annotated by experts to support fine-tuning and ensure data validity.

Using ChartNet, the team trained IBM’s Granite Vision series and additional open-source models. In evaluations across multiple tasks—such as chart reconstruction, data extraction, summarization, and question answering—the models trained on ChartNet showed significant accuracy improvements, in some cases surpassing commercial counterparts.

Why it matters

Charts are integral to many sectors, including finance and scientific research, where understanding trends and data quickly is essential. However, existing AI models often struggle to integrate visual, numerical, and linguistic elements simultaneously, which limits their effectiveness in interpreting complex charts.

By enabling smaller, open-source AI models to outperform larger, proprietary ones, ChartNet lowers barriers for smaller organizations and researchers to access high-performance chart analysis. This advances AI’s practical utility in business intelligence, scientific data interpretation, and other fields that rely on extracting actionable insights from visual data representations.

Background

While recent advances in generative AI have significantly improved natural language processing and image recognition, limited progress has been made in accurately interpreting charts. Many existing datasets contain relatively few chart images and lack the detailed multimodal information required for robust AI understanding.

Synthetic data generation has emerged as a solution to such dataset bottlenecks. By artificially producing large, labeled datasets that mimic real-world data properties, researchers can train AI models more effectively without relying on scarce manually labeled data. ChartNet represents a substantial advancement in this approach by offering a comprehensive resource tailored specifically for chart understanding tasks.

Sources

This article is based on reporting and publicly available information from the following source:

Read more Artificial Intelligence stories on Goka World News.

Giorgio Kajaia
About the author

Giorgio Kajaia

Giorgio Kajaia writes and publishes news coverage for Goka World News, focusing on technology, business, science, health, space, and major global developments. His work is centered on clear reporting, concise context, and reader-friendly explanations based on publicly available information.

View all posts by Giorgio Kajaia