The content on this page was provided by an independent third party and syndicated by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

First Benchmark for Legacy Code Comprehension Shows Specialized AI Approach Outperforms General-PurposeModels

LegacyCodeBench tests whether AI can understand COBOL well enough to document itaccurately not just generate plausible text

NEW YORK, NY, UNITED STATES, January 13, 2026 /EINPresswire.com/ — A new benchmark designed to measure whether AI systems can actuallyunderstand legacy enterprise code shows that specialized approaches significantlyoutperform general-purpose models. LegacyCodeBench, developed by Kalmantic (anapplied AI research lab) in collaboration with Hexaview Technologies, evaluates AIcomprehension of COBOL the language still processing 95% of ATM transactions and $3trillion in daily global transactions.
The benchmark finds that domain-specialized systems like Hexaview’s Legacy Insightsachieve 92% accuracy, compared to 86-90% for general-purpose models like GPT-4o andClaude Sonnet 4.

-Why This Matters
Over 220 billion lines of COBOL remain in production worldwide, but the engineers whowrote it are retiring. Modernization projects fail at rates exceeding 60%, and the pattern isusually the same: organizations try to replace systems they never fully understood.

“The risk everyone focuses on is the legacy technology itself, but that’s not actually whereprojects fall apart,” said Ankit Agarwal, Founder and CTO of Hexaview. “What kills these programs is undocumented business logic. We needed an objective way to measurewhether AI can actually understand these systems well enough to trust the output.”


-How It Works
Most AI benchmarks use another LLM to judge output quality, which creates reproducibilityproblems. LegacyCodeBench takes a different approach: it verifies claims against theoriginal program’s behavior.The process extracts specific behavioral claims from AI-generated documentation -statements like “PREMIUM is calculated by multiplying BASE-RATE by RISK-FACTOR” – andthen verifies them by executing the original COBOL program with test inputs. If the claimdoesn’t match what the code actually does, it fails.”We’re not testing whether documentation reads well,” said Nikita, co-author of the paper.”We wanted to know if you could actually trust it. There’s a difference.”The benchmark also penalizes gaming. Documentation that avoids making testable claimsscores zero on the behavioral track, which carries 50% of the total weight. And if the AIhallucinates variables that don’t exist in the source code, the entire task fails

-Results


| System | LCB Score | Structural | Doc Quality | Behavioral | T1 Basic | T4 Enterprise |
| ————————— | ——— | ———- | ———– | ———- | ——– | ————- |
| Legacy Insights (Hexaview) | 92% | 94% | 96% | 90% | 96% | 90% |
| Claude Sonnet 4 (Anthropic) | 90% | 96% | 78% | 91% | 92% | 92% |
| AWS Transform Mainframe | 88% | 98% | 68% | 91% | 88% | 87% |
| IBM Granite 13B | 87% | 93% | 72% | 90% | 89% | 84% |
| GPT-4o (OpenAI) | 86% | 92% | 71% | 89% | 91% | 82% |


Specialized systems (Legacy Insights, AWS Transform) outperform general-purposemodels, particularly on documentation quality. All models maintain reasonably strongperformance from basic programs (T1) to enterprise-scale COBOL (T4), though GPT-4oshows the largest drop (9 points).

“General-purpose models have gotten quite good at parsing legacy code, which is realprogress,” Agarwal said. “But there’s still a gap between understanding the syntax andunderstanding what the code is actually doing in a business context. That’s wherespecialization matters.”

-Open Source
LegacyCodeBench is fully open source with deterministic evaluation. The publicleaderboard is at legacycodebench.com, and the team welcomes submissions via GitHub

-Resources
• Website: legacycodebench.com
• Paper: Available at legacycodebench.com
• GitHub: github.com/kalmantic/legacycodebench
• Legacy Insights: legacyip.hexaview.ai


-About Hexaview
Hexaview is a strategic implementation partner for regulated enterprises, specializing inlegacy system preservation and modernization. Learn more: hexaviewtech.com

-About Kalmantic Labs Kalmantic is an applied AI research lab studying the challenges that emerge when AI meetsproduction systems. They publish research openly and build tools based on their findings.Learn more: kalmantic.com

LegacyCodeBench is open source under MIT license.

Ankit Agarwal
Hexaview Technologies
+1 845-653-3855
email us here

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

Aline and ECP Launch New Integration to Simplify Onboarding and Improve Sales-to-Care Handoffs

Aline and ECP Launch New Integration to Simplify Onboarding and Improve Sales-to-Care Handoffs

Senior living operators gain faster move-ins, fewer errors, and shared visibility across teams LOUISVILLE, KY, UNITED

January 21, 2026

CSU College of Law to Offer Students AltaClaro’s Certificate in ‘Fundamentals of Prompt Engineering for Lawyers’

CSU College of Law to Offer Students AltaClaro’s Certificate in ‘Fundamentals of Prompt Engineering for Lawyers’

Program expands CSU|LAW’s broader AI strategy—including a new AI Advisory Council—and builds on the school’s national

January 21, 2026

Case Study Identifies Echo Penalty™ as a Plasma Systems Output Limitation and Introduces a New Analytical Construct

Case Study Identifies Echo Penalty™ as a Plasma Systems Output Limitation and Introduces a New Analytical Construct

Findings challenge prevailing ignition–sustainment assumptions, reframing plasma instability as a systems-level

January 21, 2026

ATX Construction Handyman & Remodeling Sets a New Standard for Home Improvement and Customer Experience in Austin

ATX Construction Handyman & Remodeling Sets a New Standard for Home Improvement and Customer Experience in Austin

Austin-based ATX Construction Handyman & Remodeling LLC rolls out a customer-first home services approach with

January 21, 2026

Author Dr. Susan Krup Grunin of SKG Creations Recently Featured on Close Up Radio

Author Dr. Susan Krup Grunin of SKG Creations Recently Featured on Close Up Radio

NAPLES, FL, UNITED STATES, January 14, 2026 /EINPresswire.com/ — Dr. Susan Krup Grunin, PhD, has worn many hats:

January 21, 2026

Mytsv.com Research Reveals Alarming ‘Biological Resilience Gap’: Why the Pre-1940 Generation Outperforms Modern Youth

Mytsv.com Research Reveals Alarming ‘Biological Resilience Gap’: Why the Pre-1940 Generation Outperforms Modern Youth

The Biological Resilience Gap: A Comparative Analysis of Generational Strength, Longevity, and Mortality DEERFIELD, IL,

January 21, 2026

Dept. of Education Issues Guidance on Responsible AI Use in Schools – Including Accessibility and Privacy Principles

Dept. of Education Issues Guidance on Responsible AI Use in Schools – Including Accessibility and Privacy Principles

ED guidance urges responsible AI use in schools, prioritizing accessibility, transparency, privacy, and educator-led

January 21, 2026

Trikke to Showcase New Patrol Vehicle and Law Enforcement Accessories at SHOT Show 2026

Trikke to Showcase New Patrol Vehicle and Law Enforcement Accessories at SHOT Show 2026

We’re unveiling Trikke’s 2026 Version 3 Positron Police Spec vehicle, featuring a substantially reinforced frame and

January 21, 2026

Lil Mama’s Sweets and Treats Earns 2025 Best of Georgia Award for Scratch-Made Excellence

Lil Mama’s Sweets and Treats Earns 2025 Best of Georgia Award for Scratch-Made Excellence

AUGUSTA, GA, UNITED STATES, January 14, 2026 /EINPresswire.com/ — Some businesses satisfy cravings. Others quietly

January 21, 2026

EXTRACT ADVISORS: How a New Force in High-Speed Expansion is Changing the Investment Industry Landscape

EXTRACT ADVISORS: How a New Force in High-Speed Expansion is Changing the Investment Industry Landscape

Strategic Vision and Tech-Driven Discretionary Model Position EXTRACT as a New Force in the Global Investment Industry

January 21, 2026

Parrish Law Firm Announces 2026 Community Programs Supporting Education and Youth Engagement Across Northern Virginia

Parrish Law Firm Announces 2026 Community Programs Supporting Education and Youth Engagement Across Northern Virginia

New school-focused initiative joins returning scholarship, educator support, and youth wellness programs Education,

January 21, 2026

Membrion Named Part of 2026 Global Cleantech 100

Membrion Named Part of 2026 Global Cleantech 100

Recognition highlights companies championing resource security and economic durability Industrial operators are under

January 21, 2026

Healthspan Collective & Regen Therapy Announce Partnership to Advance Next-Gen Regenerative Medicine Education & Access

Healthspan Collective & Regen Therapy Announce Partnership to Advance Next-Gen Regenerative Medicine Education & Access

The partnership aims to provide curated access to credible science, responsible innovation, and practical frameworks

January 21, 2026

The Club at Mediterra earns Elite status from Distinguished Clubs

The Club at Mediterra earns Elite status from Distinguished Clubs

National recognition places the club among just 132 private clubs nationwide for exceptional service, amenities and

January 21, 2026

RISE Healthy Communities Summit Returns with Expanded Whole Person Health Mission

RISE Healthy Communities Summit Returns with Expanded Whole Person Health Mission

ORLANDO, FL, UNITED STATES, January 14, 2026 /EINPresswire.com/ — The RISE Healthy Communities Summit, formerly the

January 21, 2026

East West Partners And Sonnenalp Hotel Create Exclusive Partnership To Offer Prima Residences At The Sonnenalp

East West Partners And Sonnenalp Hotel Create Exclusive Partnership To Offer Prima Residences At The Sonnenalp

Four Luxury Homes in the Heart of Vail Village Combine Mountain Craftsmanship with the Legendary Service and Amenities

January 21, 2026

IdeaLift Accepted Into Microsoft Partner Network

IdeaLift Accepted Into Microsoft Partner Network

Partnership enables deeper Microsoft Teams integration and Azure Marketplace availability for product teams worldwide

January 21, 2026

Grant Brothers Tree Service Helps Form New National Tree Care Association

Grant Brothers Tree Service Helps Form New National Tree Care Association

Virginia-based tree care company plays a leadership role in forming a new national association focused on safety,

January 21, 2026

RETSY Ranks Among Arizona’s Top 10 Residential Real Estate Brokerages; 10 Agents and Teams in Phoenix’s Most Productive

RETSY Ranks Among Arizona’s Top 10 Residential Real Estate Brokerages; 10 Agents and Teams in Phoenix’s Most Productive

Phoenix Business Journal Rankings Underscore RETSY's Leadership in Sales Volume, Agent Productivity, and Luxury Market

January 21, 2026

Gross-Wen Technologies Named on the 2026 Global Cleantech 100

Gross-Wen Technologies Named on the 2026 Global Cleantech 100

A Year Defined by Intensifying Competition, Resource Security, and the Rise of Economic Durability as Cleantech’s New

January 21, 2026

Crow’s Nest Campground Opens 2026 Season Reservations with Enhanced Family Amenities

Crow’s Nest Campground Opens 2026 Season Reservations with Enhanced Family Amenities

Newport, NH destination campground announces early booking for Mount Sunapee region getaways featuring upgraded

January 21, 2026

Law Office of Justin C. Frankel, P.C. Successfully Reinstates Disability Benefits for Senior Executive

Law Office of Justin C. Frankel, P.C. Successfully Reinstates Disability Benefits for Senior Executive

GARDEN CITY, NY, UNITED STATES, January 14, 2026 /EINPresswire.com/ — Disability Benefits Reinstated for Senior

January 21, 2026

Insurance Expert Bill Pancake of Kissimmee, FL Discusses Auto Insurance Coverage in HelloNation

Insurance Expert Bill Pancake of Kissimmee, FL Discusses Auto Insurance Coverage in HelloNation

How much auto insurance is enough in Central Florida? KISSIMMEE, FL, UNITED STATES, January 14, 2026 /EINPresswire.com/

January 21, 2026

AstroDoc Announces ASTRID – Healthcare AI That Solves the ‘Last Mile’

AstroDoc Announces ASTRID – Healthcare AI That Solves the ‘Last Mile’

Healthtech company with integrated U.S. medical practice offers free global AI access and seamless care delivery –

January 21, 2026

Golden Waves Grain Announces Strategic Investment from Foote Family, Advancing $200m Goodland KS Milling/Bakery Project

Golden Waves Grain Announces Strategic Investment from Foote Family, Advancing $200m Goodland KS Milling/Bakery Project

Golden Waves Grain announced the major investment. The $200M project breaks ground Spring '26. Projects like this help

January 21, 2026

Rowan Foundation Launches National Writing Scholarships For Undergraduate Women

Rowan Foundation Launches National Writing Scholarships For Undergraduate Women

Program Includes a First-of-Its-Kind National Writing Scholarship for Women Affected by Blood Clots and Clotting

January 21, 2026

Advanced Axis Delivers Industry-Leading Results in New AT&T Partnership

Advanced Axis Delivers Industry-Leading Results in New AT&T Partnership

Advanced Axis delivers measurable AT&T growth, generating 5,200 new customers and $18.2M in revenue through

January 21, 2026

LinkedIn Automation Update Helps Sales Teams Scale Outreach Without Losing Message Quality

LinkedIn Automation Update Helps Sales Teams Scale Outreach Without Losing Message Quality

NEW YORK, NY, UNITED STATES, January 14, 2026 /EINPresswire.com/ — As buyer expectations continue to rise, many sales

January 21, 2026

Revelation Biosciences Inc. to Present at The International Conference on Advances in Critical Care Nephrology (AKI & CRRT 2026)

Revelation Biosciences Inc. to Present at The International Conference on Advances in Critical Care Nephrology (AKI & CRRT 2026)

– Presentation to Include Additional Positive Data from the Recently Completed PRIME Clinical Study – SAN DIEGO, CA /

January 21, 2026

Advanced Health Selects 1upHealth to Lead Interoperability Initiatives

Advanced Health Selects 1upHealth to Lead Interoperability Initiatives

Oregon Coordinated Care Organization to Leverage 1upHealth's Comprehensive Interoperability Suite to Drive CMS

January 21, 2026

Todd Buchanan Named President of AmeriLife Wealth

Todd Buchanan Named President of AmeriLife Wealth

Industry veteran brings nearly three decades of experience to lead AmeriLife's expanding Wealth Distribution platform

January 21, 2026

Datavault AI Announces it has Developed Patented AI Rating Technology Launching Globally with Fintech.TV in Pilot Season

Datavault AI Announces it has Developed Patented AI Rating Technology Launching Globally with Fintech.TV in Pilot Season

Introducing AI Content Detection, Real-Time Bias Meter and Breakthrough Interactive Polling Powered by ADIO®

January 21, 2026

Pacific Avenue Capital Partners to Acquire U.S. Power Chain Hoist and Chain Business from Columbus McKinnon

Pacific Avenue Capital Partners to Acquire U.S. Power Chain Hoist and Chain Business from Columbus McKinnon

LOS ANGELES, CA / ACCESS Newswire / January 14, 2026 / Pacific Avenue Capital Partners ("Pacific Avenue"), a Los

January 21, 2026

Creative Fabrica Enters the Third Dimension: New AI Tools Turn Text into 3D Print Models Instantly

Creative Fabrica Enters the Third Dimension: New AI Tools Turn Text into 3D Print Models Instantly

Create 3D models from text in seconds. Creative Fabrica’s new AI tools let you generate, validate, and export

January 21, 2026

​​Let Grow Announces Strategic Expansion: Nonprofit Nearly Triples Staff to Advance Childhood Independence Movement

​​Let Grow Announces Strategic Expansion: Nonprofit Nearly Triples Staff to Advance Childhood Independence Movement

Responding to rising demand from schools and families, Let Grow nearly triples its team to expand evidence-based

January 21, 2026

TX Supreme Court Reasserts Authority Over Law-School Approval for Bar Admission, Ending Automatic Reliance on ABA

TX Supreme Court Reasserts Authority Over Law-School Approval for Bar Admission, Ending Automatic Reliance on ABA

Landmark rule change follows years of public debate—and highlights the real-world impact of attorney Nelson A. Locke’s

January 21, 2026

Ideal Physical Therapy Helps Golfers Address Common Injuries and Improve Performance

Ideal Physical Therapy Helps Golfers Address Common Injuries and Improve Performance

One-on-one physical therapy led by Dr. James Harris, PT, DPT, helping golfers reduce pain, improve movement, and stay

January 21, 2026

70-Year-Old Historian Releases First Video Game After 36 Years in Educational Software Development

70-Year-Old Historian Releases First Video Game After 36 Years in Educational Software Development

History Run brings American history to life through fast-paced gameplay As player attempt to restore artifacts to the

January 21, 2026

New York Comedy Film Festival Announces Full Schedule for Inaugural Weeklong Celebration of Comedy Film February 15 – 22

New York Comedy Film Festival Announces Full Schedule for Inaugural Weeklong Celebration of Comedy Film February 15 – 22

NYC’s first festival dedicated exclusively to comedy presents 75+ features, shorts, episodics, and docs, plus filmmaker

January 21, 2026

Sustaira Enhances Sustainability Teams Productivity And Ends the ‘One-Size-Fits-None’ Era of Sustainability Solutions

Sustaira Enhances Sustainability Teams Productivity And Ends the ‘One-Size-Fits-None’ Era of Sustainability Solutions

Modular, AI-Powered Solutions Accelerate Decarbonization, Risk Management, and Operational Sustainability Across

January 21, 2026