◈ X-Research
← Category Events
What happened

Cognition released Devin 2.0 in mid-April 2026 with cloud agents that run for minutes or hours (addressing the "async valley of death" where agents would time out before complex work finished), end-to-end testing with computer use (agents submit video recordings of test runs on Linux desktops), Fast Mode (2x faster at 4x cost), and streaming thoughts for visibility into agent reasoning. Real-world deployment: over 8 months, Fortune 500 companies used Devin for COBOL legacy modernization — documenting millions of lines of code, migrating customs workflows from COBOL to AWS Lambda, refactoring tax ID logic across hundreds of programs. Developers review Devin's PRs in Cursor's Windsurf IDE, check diffs, run tests, or hand off to local agents for tweaks.

Why it matters for Seva's category

The COBOL modernization story is the clearest evidence that AI agents are being trusted with production-critical enterprise systems, not just new greenfield code. COBOL runs most of global banking and government finance — if a Fortune 500 is willing to let an AI agent refactor tax ID logic across hundreds of programs, the risk tolerance for enterprise AI has crossed a threshold. For GTM AI operators: the "video test submission" pattern is the UX innovation to watch — showing CFOs and CIOs a video of what the agent did (and how tests passed) is a form of auditability that addresses the trust barrier. Long-running agents (hours, not minutes) directly enable multi-step GTM workflows: prospect research, enrichment, sequencing, and follow-up can now run as a single uninterrupted agent session.

Content angles
https://cognition.ai/blog/devin-2 ↗ https://cognition.ai/blog/how-devin-is-modernizin… ↗