This page contains press release content distributed by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

First Benchmark for Legacy Code Comprehension Shows Specialized AI Approach Outperforms General-PurposeModels

LegacyCodeBench tests whether AI can understand COBOL well enough to document itaccurately not just generate plausible text

NEW YORK, NY, UNITED STATES, January 13, 2026 /EINPresswire.com/ — A new benchmark designed to measure whether AI systems can actuallyunderstand legacy enterprise code shows that specialized approaches significantlyoutperform general-purpose models. LegacyCodeBench, developed by Kalmantic (anapplied AI research lab) in collaboration with Hexaview Technologies, evaluates AIcomprehension of COBOL the language still processing 95% of ATM transactions and $3trillion in daily global transactions.
The benchmark finds that domain-specialized systems like Hexaview’s Legacy Insightsachieve 92% accuracy, compared to 86-90% for general-purpose models like GPT-4o andClaude Sonnet 4.

-Why This Matters
Over 220 billion lines of COBOL remain in production worldwide, but the engineers whowrote it are retiring. Modernization projects fail at rates exceeding 60%, and the pattern isusually the same: organizations try to replace systems they never fully understood.

“The risk everyone focuses on is the legacy technology itself, but that’s not actually whereprojects fall apart,” said Ankit Agarwal, Founder and CTO of Hexaview. “What kills these programs is undocumented business logic. We needed an objective way to measurewhether AI can actually understand these systems well enough to trust the output.”


-How It Works
Most AI benchmarks use another LLM to judge output quality, which creates reproducibilityproblems. LegacyCodeBench takes a different approach: it verifies claims against theoriginal program’s behavior.The process extracts specific behavioral claims from AI-generated documentation -statements like “PREMIUM is calculated by multiplying BASE-RATE by RISK-FACTOR” – andthen verifies them by executing the original COBOL program with test inputs. If the claimdoesn’t match what the code actually does, it fails.”We’re not testing whether documentation reads well,” said Nikita, co-author of the paper.”We wanted to know if you could actually trust it. There’s a difference.”The benchmark also penalizes gaming. Documentation that avoids making testable claimsscores zero on the behavioral track, which carries 50% of the total weight. And if the AIhallucinates variables that don’t exist in the source code, the entire task fails

-Results


| System | LCB Score | Structural | Doc Quality | Behavioral | T1 Basic | T4 Enterprise |
| ————————— | ——— | ———- | ———– | ———- | ——– | ————- |
| Legacy Insights (Hexaview) | 92% | 94% | 96% | 90% | 96% | 90% |
| Claude Sonnet 4 (Anthropic) | 90% | 96% | 78% | 91% | 92% | 92% |
| AWS Transform Mainframe | 88% | 98% | 68% | 91% | 88% | 87% |
| IBM Granite 13B | 87% | 93% | 72% | 90% | 89% | 84% |
| GPT-4o (OpenAI) | 86% | 92% | 71% | 89% | 91% | 82% |


Specialized systems (Legacy Insights, AWS Transform) outperform general-purposemodels, particularly on documentation quality. All models maintain reasonably strongperformance from basic programs (T1) to enterprise-scale COBOL (T4), though GPT-4oshows the largest drop (9 points).

“General-purpose models have gotten quite good at parsing legacy code, which is realprogress,” Agarwal said. “But there’s still a gap between understanding the syntax andunderstanding what the code is actually doing in a business context. That’s wherespecialization matters.”

-Open Source
LegacyCodeBench is fully open source with deterministic evaluation. The publicleaderboard is at legacycodebench.com, and the team welcomes submissions via GitHub

-Resources
• Website: legacycodebench.com
• Paper: Available at legacycodebench.com
• GitHub: github.com/kalmantic/legacycodebench
• Legacy Insights: legacyip.hexaview.ai


-About Hexaview
Hexaview is a strategic implementation partner for regulated enterprises, specializing inlegacy system preservation and modernization. Learn more: hexaviewtech.com

-About Kalmantic Labs Kalmantic is an applied AI research lab studying the challenges that emerge when AI meetsproduction systems. They publish research openly and build tools based on their findings.Learn more: kalmantic.com

LegacyCodeBench is open source under MIT license.

Ankit Agarwal
Hexaview Technologies
+1 845-653-3855
email us here

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

Deloitte Taiwan and C2A Security Partner to Accelerate Innovation in Product Security and Risk Management

Deloitte Taiwan and C2A Security Partner to Accelerate Innovation in Product Security and Risk Management

Cybersecurity for industrial, semiconductor, and medical device cyber-physical products, focused on Taiwan compliance and global export markets C2A and Deloitte are enabling Taiwanese companies to…

January 13, 2026

Jeskell Systems and Cobalt Iron Host Federal Webinar on Zero Access Backup and AI-Driven Resilience

Jeskell Systems and Cobalt Iron Host Federal Webinar on Zero Access Backup and AI-Driven Resilience

This session highlights Zero Access backup architecture and AI-driven automation designed to improve resilience and recovery confidence for federal IT teams. As cyber threats evolve,…

January 13, 2026

Fulflex Sets New Bar for Sustainability and Manufacturing Excellence

Fulflex Sets New Bar for Sustainability and Manufacturing Excellence

VT, UNITED STATES, January 8, 2026 /EINPresswire.com/ — Fulflex, a global leader in thin-gauge calendering and polymer innovation, continues its upward trajectory in late 2025….

January 13, 2026

Chicago-based Restoration Company Announces New Niles Office Location

Chicago-based Restoration Company Announces New Niles Office Location

ServiceMaster Restoration by Zaba expands operations with a new Niles office and warehouse to enhance response times

January 13, 2026

Loa Carbon Appoints Ambassador Robert S. Gelbard as Chair of Political Advisory Board

Loa Carbon Appoints Ambassador Robert S. Gelbard as Chair of Political Advisory Board

Veteran statesman and senior diplomat brings four decades of global policy leadership to support Loa Carbon’s expansion

January 13, 2026

Every Mother Counts Announces U.S. Maternal Health Press Fellowship: Applications Open Jan 12-Feb 23

Every Mother Counts Announces U.S. Maternal Health Press Fellowship: Applications Open Jan 12-Feb 23

The fellowship is a learning opportunity that will help journalists understand the challenges and solutions shaping

January 13, 2026

Commio to Exhibit at ITEXPO Florida 2026

Commio to Exhibit at ITEXPO Florida 2026

Telecom provider to showcase RCS messaging capabilities and Branded Calling ID™ February 10-12 at the Fort

January 13, 2026

Ruadán Books and Temple Dark Books Form Publishing Cooperative

Ruadán Books and Temple Dark Books Form Publishing Cooperative

Ruadán Books and Temple Dark Books come together to form a Dark Fiction Independent Publishing Cooperative By putting

January 13, 2026

Genesis Exotic Transport Details Industry-Standard Securement Standards for Enclosed Luxury and Exotic Vehicle Transport

Genesis Exotic Transport Details Industry-Standard Securement Standards for Enclosed Luxury and Exotic Vehicle Transport

Genesis Exotic Transport Releases Industry-Standard Protocols for Enclosed Vehicle Shipping TAMPA, FL, UNITED STATES,

January 13, 2026

Gaming Intelligence Platform Reveals Innovation 18-36 Months Early

Gaming Intelligence Platform Reveals Innovation 18-36 Months Early

FutureOfGaming.com – Strategic intelligence service revealing where Sony, Microsoft, EA are investing and what

January 13, 2026

Texas Slab Guys Launches AI-Powered, First-Ever Self-Serve Concrete Leveling Quote Tool for Homeowners

Texas Slab Guys Launches AI-Powered, First-Ever Self-Serve Concrete Leveling Quote Tool for Homeowners

Industry trailblazers Texas Slab Guys unveil LevelEstimate™, an AI-powered concrete leveling quote tool setting a new

January 13, 2026

RW3 CultureWizard Announces 2026 Predictions Webinar on Leading Through Uncertainty

RW3 CultureWizard Announces 2026 Predictions Webinar on Leading Through Uncertainty

Webinar will explore how culture becomes a competitive advantage amid AI acceleration, workforce shifts, and global

January 13, 2026

SMX Applies Molecular Tracking Technology to Silver Supply Chains

SMX Applies Molecular Tracking Technology to Silver Supply Chains

NEW YORK, NY / ACCESS Newswire / January 13, 2026 / SMX (NASDAQ:SMX)(NASDAQ:SMXWW), a leader in material-embedded

January 13, 2026

MyLegalWin Launches 2026 Awards, Marking Third Year of Attorney and Law Firm Recognition

MyLegalWin Launches 2026 Awards, Marking Third Year of Attorney and Law Firm Recognition

MyLegalWin launches its 2026 awards, marking three years of attorney and law firm recognition with expanded city-based

January 13, 2026

Louisiana Impact Fund launches to keep companies, jobs & wealth in Louisiana

Louisiana Impact Fund launches to keep companies, jobs & wealth in Louisiana

Targets $100 million to back Louisiana-based businesses and preserve in-state ownership LAFAYETTE, LA, UNITED STATES,

January 13, 2026

CodaPet launches compassionate in-home pet euthanasia services in Akron, OH, and surrounding areas.

CodaPet launches compassionate in-home pet euthanasia services in Akron, OH, and surrounding areas.

The veterinarian-owned startup empowers a network of veterinarians who provide in-home euthanasia to ease the passing

January 13, 2026

Sphera Partners With Rolls-Royce Power Systems on mtu Backup Power Gensets Environmental Product Declarations

Sphera Partners With Rolls-Royce Power Systems on mtu Backup Power Gensets Environmental Product Declarations

Collaboration delivers verified EPDs to support procurement and regulatory reporting The published EPDs are testament

January 13, 2026

DOGTV Launches Free, Ad-Supported Content on Two Channels

DOGTV Launches Free, Ad-Supported Content on Two Channels

DOGTV channel designed for dogs and Unleashed by DOGTV for pet parents now available free with pre-roll ads or unlock

January 13, 2026

Innovation Driving Housing Solutions: New Podcast Season Explores How Innovation is Tackling Canada’s Housing Crisis

Innovation Driving Housing Solutions: New Podcast Season Explores How Innovation is Tackling Canada’s Housing Crisis

Featuring BC Minister of Housing and Municipal Affairs, Christine Boyle VANCOUVER, BRITISH COLUMBIA, CANADA, January

January 13, 2026

Ignition expands executive leadership to accelerate its next chapter of global growth

Ignition expands executive leadership to accelerate its next chapter of global growth

Former Totango growth leader Anne Ting named Chief Marketing Officer; payments executive Greg Hatcher joins Ignition as

January 13, 2026

Rimes Announces Strategic Partnerships with PANTA, BMLL, and Ortec Finance to Strengthen Data and Analytics Capabilities

Rimes Announces Strategic Partnerships with PANTA, BMLL, and Ortec Finance to Strengthen Data and Analytics Capabilities

Partnerships accelerate client access to interoperable data ecosystem with powerful analytics, deeper insights and

January 13, 2026

Algo Acquires Demand Driven Technologies, Makers of Intuiflow, to Create a Unified Demand-to-Supply Planning Platform

Algo Acquires Demand Driven Technologies, Makers of Intuiflow, to Create a Unified Demand-to-Supply Planning Platform

New company will deliver a unified platform that connects supply chain planning and execution to operational reality

January 13, 2026

Fasoo Highlights Critical Need for an AI Governance Platform to Secure Semiconductor Innovation

Fasoo Highlights Critical Need for an AI Governance Platform to Secure Semiconductor Innovation

BETHESDA, MD, UNITED STATES, January 13, 2026 /EINPresswire.com/ — Fasoo, the leader in data-centric security,

January 13, 2026

American Marketing Association Releases Groundbreaking Future Trends in Marketing Report

American Marketing Association Releases Groundbreaking Future Trends in Marketing Report

Report explores the dynamic landscape shaped by shifting trends, emerging technologies, and evolving consumer

January 13, 2026

Future-Ready Communication: LISTSERV® Powers Vision and Renewal for 2026

Future-Ready Communication: LISTSERV® Powers Vision and Renewal for 2026

From academic research to public engagement, LISTSERV supports long-term vision and renewal by enabling independent,

January 13, 2026

MATLF Launches New ‘Acce$$ Loan’ to Expand Financial Access and Independence for Michiganders with Disabilities

MATLF Launches New ‘Acce$$ Loan’ to Expand Financial Access and Independence for Michiganders with Disabilities

MATLF is proud to be a part of freedom from predatory lending for people who often can least afford the cost when

January 13, 2026

Charleston’s Rooter Man Plumbing Reflects on Another Successful Year of Continued Service

Charleston’s Rooter Man Plumbing Reflects on Another Successful Year of Continued Service

Rooter Man Plumbing reflects on another successful year servicing Charleston, highlighting consistent demand, customer

January 13, 2026

Fountainhead Control Rooms Promotes Noah White to Director of Control Room Solutions

Fountainhead Control Rooms Promotes Noah White to Director of Control Room Solutions

Fountainhead Control Rooms, provider of mission-critical control room design / integration services, appointed Noah

January 13, 2026

Boostr and Passendo Announce Strategic Partnership to Unify Ad Management for Publishers’ High-Value Email Inventory

Boostr and Passendo Announce Strategic Partnership to Unify Ad Management for Publishers’ High-Value Email Inventory

Boostr and Passendo partner to unify email and display ad management, allowing publishers to forecast and execute

January 13, 2026

Pueblo County Becomes First in Colorado to Use Blitz AI to Accelerate Building Permit Reviews

Pueblo County Becomes First in Colorado to Use Blitz AI to Accelerate Building Permit Reviews

Colorado adopts AI to accelerate building permits, improve compliance, predictability, and housing delivery, thanks to

January 13, 2026

The Designers Collaborative, a designer focused buying program is sharing ways to increase profits in the New Year

The Designers Collaborative, a designer focused buying program is sharing ways to increase profits in the New Year

The Designers Collaborative unlocks ‘stocking dealer pricing’ for members of their innovative designer focused buying

January 13, 2026

QNA Drives Global Collaboration on Digital Payment Security

QNA Drives Global Collaboration on Digital Payment Security

DUBAI, UNITED ARAB EMIRATES, January 13, 2026 /EINPresswire.com/ — Following phenomenal success in cities such as

January 13, 2026

Allocore Announces Advisory Board to Accelerate Modernization of Federal Lending Systems

Allocore Announces Advisory Board to Accelerate Modernization of Federal Lending Systems

Former Congressional and Treasury leaders bring deep federal credit, policy, & oversight expertise to support

January 13, 2026

Infinite Banking in Canada presents New Opportunities for Owner Financing

Infinite Banking in Canada presents New Opportunities for Owner Financing

Canadian entrepreneurs want control, certainty, and tax efficiency. Infinite Banking gives business owners a

January 13, 2026

Creatium Solves AI’s ‘Math Problem’ : Delivering Accurate Learning Content from Algebra to Accounting

Creatium Solves AI’s ‘Math Problem’ : Delivering Accurate Learning Content from Algebra to Accounting

"AI can't do math" is no longer true. Our team has been using Creatium Studio's math capabilities to create interactive

January 13, 2026

Britive Sets the Identity Security Standard for CallSine’s Autonomous Agents

Britive Sets the Identity Security Standard for CallSine’s Autonomous Agents

Britive integrates with CallSine to enforce identity governance and secure access across autonomous multi-agent AI

January 13, 2026

Pro Haul & Services Brings Fast, Friendly and Affordable Junk Removal to the Cincinnati Area

Pro Haul & Services Brings Fast, Friendly and Affordable Junk Removal to the Cincinnati Area

CINCINNATI, OH, UNITED STATES, January 13, 2026 /EINPresswire.com/ — ProHaul & Services, a locally owned junk

January 13, 2026

Poppins Wins Consumer Health & Wellness Award at 2025 Health Tech Challengers

Poppins Wins Consumer Health & Wellness Award at 2025 Health Tech Challengers

Award Highlights Growing Momentum Behind Accessible, High-Quality Virtual Pediatric Care NEW YORK, NY, UNITED STATES,

January 13, 2026

Author Theodore A. Anderson Releases New Science Fiction Epic Set After World War III

Author Theodore A. Anderson Releases New Science Fiction Epic Set After World War III

JOHNSON CITY, TN, UNITED STATES, January 13, 2026 /EINPresswire.com/ — Author T.A. Anderson has released a new science

January 13, 2026

FireCreek Snacks Expands Retail Presence Across New York City with New Placements in Manhattan and Brooklyn Neighborhood Markets

FireCreek Snacks Expands Retail Presence Across New York City with New Placements in Manhattan and Brooklyn Neighborhood Markets

Boise, Idaho – FireCreek Snacks, a premium protein snack brand, is continuing its retail expansion in New York City

January 13, 2026