Apr 13, 2026
LitXBench: A Benchmark for Extracting Experiments from Materials Literature
We built a benchmark to measure how well LLMs can extract experiments from materials science papers, and found that most pipelines task LLMs incorrectly, leading to inaccurate extractions.