Scientific charts are information-rich, but most of the underlying data remain mostly inaccessible to machines, despite the rapid development of artificial intelligence and scientific databases 📊 Read how authors tackled this in a recent 'Behind The Paper' article 🔍
Communications Engineering has received its first Journal Impact Factor! 🏆 Thanks to all our authors who have trusted us with publishing their work, especially early in the journal's lifetime 📖
Over the years, researchers have proposed helicopters, fixed-wing aircraft, balloons, gliders, hoppers and hybrid concepts for exploring Mars. Which of these ideas actually make sense, and why?🚀 Read 'Behind The Paper' for a recent Comms Eng Perspective 🔍
Ghost noise in single-fiber bidirectional transmission links and its suppression approaches
LightPro: a linear photonic processor with full programmability
Recently launched collection! 🚗 In this collection, we bring together research communities at the intersection of risk and resilience engineering, transportation engineering, network science, and supply-chain management 🚛
Test-based equivalent-material method for collapse qualification of helically wound and layered cylindrical structures
What does Next-Gen cooling for power semiconductors in EVs look like?❄️ Henry A Martin and colleagues design a direct-to-package cooling design that aims to move liquid cooling closer to the semiconductor device - no need for thermal interface materials! Read more here: www.nature.com/articles/s44...
📢 @ai.cam.ac.uk's Local Gov #AI Accelerator pairs @cam.ac.uk researchers with local #councils to develop AI solutions that deliver tangible public value, from automating housing data collection to detecting fly-tipping using cameras on refuse vehicles
www.eng.cam.ac.uk/news/univers...
#flytipping
A PhD project on Martian aerial robotics became a deeper question: not only how to fly on Mars, but which aircraft concepts make sense, where, why, and under what planetary conditions.
Communications Engineering, Published online: 22 June 2026; doi:10.1038/s44172-026-00712-6Wujie Wang and colleagues report a hybrid modulation technique that suppresses ghost noise in single fiber bidirectional links. This extends low-noise sensing distances sixfold, enabling large-scale sensing via existing communications optical cables.
dlvr.it
Communications Engineering, Published online: 17 June 2026; doi:10.1038/s44172-026-00707-3Amin Shafiee and colleagues propose LightPro, a programmable photonic processor leveraging phase change materials. This framework improves the scalability and footprint of energy-efficient optical hardware for next generation AI.
This collection focuses on the intersection of risk and resilience engineering, transportation, network science, and supply chains, highlighting conceptual, ...
dlvr.it
Communications Engineering, Published online: 16 June 2026; doi:10.1038/s44172-026-00699-0Yuteng Zhang and colleagues propose a back-inferred equivalent material method for collapse qualification of helically wound and layered cylindrical structures. The framework infers a material law from radial compression test to predict critical collapse pressure without full-scale hydrostatic testing.
The inaugural Local Government AI Accelerator, launched by ai@cam – the University of Cambridge’s flagship mission on AI – is a programme that establishes a new model for how universities and local go...
Department of Engineering, University of Cambridge
Video
Scientific charts are one of the most information-rich components of modern research papers. Every year, millions of charts are published across disciplines ranging from materials science and chemistry to biology and medicine. These charts often contain valuable experimental measurements that cannot be found anywhere else in the article.
However, despite the rapid development of artificial intelligence and scientific databases, most of these data remain effectively inaccessible to machines. Researchers can read a chart and immediately understand the trends it presents, but extracting the underlying numerical values often requires tedious manual work. Existing tools typically depend on human interaction and become impractical when processing thousands of charts at scale.
Our motivation for developing ChartRecover originated from this challenge. As we worked on large-scale scientific data collection and AI-ready database construction, we repeatedly encountered valuable experimental results that existed only as images embedded in publications. We realized that unlocking these hidden data resources could significantly accelerate data-driven scientific discovery.
The Challenge We Did Not Expect
When we first set out to automate chart extraction, we assumed that identifying the numbers on the axes would be the easy part. We thought that once an optical character recognition (OCR) system read the tick labels, we could simply map those text boxes to their corresponding numerical values.
However, we quickly hit an unexpected roadblock. We found that even small visual offsets between the text labels and the actual tick marks produced massive errors in the recovered data coordinates. Because of variations in font sizes, line spacing, and general layout, the visual center of a text label rarely aligns perfectly with the physical tick mark. If we directly used the text's center point as our anchor, those tiny pixel deviations amplified into significant systematic mapping errors when calculating the final scientific values. This seemingly minor detail—the slight misalignment of text—became one of the biggest obstacles to achieving high-fidelity data extraction.
Building ChartRecover
To solve these issues, we developed ChartRecover(https://www.nature.com/articles/s44172-026-00691-8), an end-to-end framework designed to interpret charts much like a human researcher does, but at machine speed. We built the system around three core capabilities:
Element Detection: Instead of relying on rigid, pre-programmed templates, our system uses an object detection architecture to intuitively identify common chart components. It robustly detects axes, tick marks, legends, and the actual data points across a wide variety of visual styles and complex layouts.
Coordinate Recovery: To overcome the text deviation problem, we introduced a specialized algorithm that precisely associates the semantic meaning of the tick text with the exact physical pixel coordinates of the tick mark. By treating the physical tick mark as the true anchor, we successfully eliminated the impact of text rendering offsets, establishing a highly accurate mapping between the image pixels and the real-world numerical scales.
Adaptive Parsing: In the real world, charts are messy and diverse. Bar charts, for instance, can be horizontal or vertical, and their data can be independent or stacked. We engineered ChartRecover to systematically analyze the geometric overlap and boundaries of these structures. This allows the system to automatically distinguish between stacked and non-stacked data geometries, establishing accurate baselines regardless of the journal's unique plotting conventions.
From charts to Scientific Knowledge
By transforming static images into machine-readable numbers, ChartRecover opens entirely new possibilities for building large-scale, AI-ready scientific databases.
Historically, building databases for materials science, chemistry, or biomedical research required researchers to manually interact with extraction tools—a process completely unsuited for large-scale scenarios. Now, researchers can automatically recover absolute measurement data directly from visual plots. This structured data can immediately support advanced downstream applications, such as large-scale structure-property relationship mining. By making this data accessible, we can accelerate the screening of new catalysts, the discovery of novel battery materials, and the construction of comprehensive scientific knowledge graphs.
Looking Ahead
More broadly, we hope this work contributes to a future in which scientific charts become as searchable and reusable as scientific text. Unlocking the vast amount of empirical evidence currently trapped inside published charts could significantly expand the resources available for AI-driven scientific discovery.
As the shift toward automated, high-quality data accumulation continues, tools like ChartRecover will be at the forefront of facilitating the broader reuse of global scientific data. Ultimately, by making the data behind the charts accessible to everyone, we can enhance research transparency, improve automated scientific verification, and ensure that no valuable experimental result is left behind simply because it was published as an image.