
July 2024: CrowdStrike’s faulty sensor configuration update spirals into a multi-day outage— grounding airlines, emptying office buildings of their laptops, and draining as much as $5.4 billion from Fortune 500 companies. Not two weeks later, a traffic surge and vendor glitch took Charles Schwab’s and Fidelity’s trading apps offline at the height of a market sell-off, locking out tens of thousands of investors.
Vibe coding or not, enterprises everywhere are discovering that AI-native software development is introducing new classes of errors, from subtle logic flaws to algorithmic biases that only manifest in production. As software development accelerates to warp speed, resilience struggles to keep pace—and this widening gap is precisely where Lightrun thrives.
We’re excited to announce our participation in Lightrun’s $70M Series B funding round, alongside Accel, Insight Partners, Citi, Glilot Capital, and GTM Capital. This Tel Aviv-based team is revolutionizing how the world’s largest organizations respond to production issues, giving software the reflexes to fix itself before users even notice something’s wrong.
The Resilience Challenge in Enterprise Software
The collision of two seemingly contradictory trends is creating a perfect storm in software development. AI code assistants are exponentially accelerating delivery, flooding production environments with more code than ever. Meanwhile, incidents like the CrowdStrike outage remind us that software failures carry astronomical costs—both financial and reputational.
Splunk and Oxford Economics calculated that unplanned downtime costs Global 2000 businesses $400 billion annually—that represents $200 million per company each year, across lost revenue ($49M); marketing, public relations, and investor relations ($27M); regulatory fines ($22M); and more. For public cloud-deployed software, traditional observability tools help teams see problems faster, but research shows that as many as 40% of production incidents still trace back to code or configuration bugs, and nearly 80% of those are mitigated through rollbacks or operational work-arounds rather than a permanent code fix, prolonging average time-to-resolution.
Lightrun transforms this paradigm entirely. Rather than just monitoring applications and alerting teams to issues, their platform enables developers to diagnose and remediate problems in running code without stopping or redeploying it—like a surgeon performing minimally invasive procedures while the heart keeps beating. Under the hood, a three-tier architecture—IDE plug-in, in-process SDK, and policy-centric management server—keeps production traffic fast and data private.
Real-Time Remediation: Fulfilling the Promise of Observability in the AI Era
What makes Lightrun truly groundbreaking is its AI-powered live instrumentation platform, which lets developers inject logs, metrics, or traces into running code with surgical precision. Lightrun’s Autonomous Debugger takes this a step further, automatically correlating issues with precise code pathways, and suggesting fixes.
As Lightrun CEO Ilan Peleg puts it: “As autonomous software development becomes reality, we see the next frontier in autonomous remediation—software that can fix itself autonomously.”
This shift from reactive monitoring to proactive, autonomous problem resolution represents a fundamental leap forward in how organizations approach software resilience. The game is now about preventing failures from impacting users in the first place—not fixing them faster.
What Customers Told Us
“Once you use Lightrun, it becomes a must-have solution,” one enterprise customer told us. Their engineering teams can now debug in production without friction, allowing code to continue running while analyzing and resolving problems.
Enterprise users report dramatic improvements in incident response times, with organizations reducing mean time to resolution (MTTR) from hours to minutes. This acceleration happens because development teams can finally debug production issues without the traditional cycle of reproducing bugs locally or deploying speculative fixes.
The market has responded enthusiastically to this approach. Lightrun’s revenue grew 4.5× year-over-year in 2024, and their team doubled in size to support increasing global demand. Their Fortune 500 customers now include ADP, AT&T, Citi, ICE/NYSE, Inditex, Microsoft, Priceline, Salesforce, and SAP—alongside several F10 giants powering mission-critical applications worldwide.
And because the management server can run as SaaS, single-tenant, or fully air-gapped on-prem, banks and governments can adopt it without moving sensitive code off-site.
A Market Ready for Transformation
We’re particularly excited about Lightrun’s potential in heavily regulated sectors—especially banking and financial services—where downtime comes with regulatory fines and front-page headlines. Many of these organizations already trust Lightrun with their most critical systems, and the new capital will help deepen these relationships while scaling their direct and partner go-to-market strategy.
The timing couldn’t be better. As enterprises increasingly prioritize operational resilience to preserve customer trust, ensure business continuity, and maintain compliance, Lightrun’s technology addresses an urgent and growing need.
Betting on the Team
We connected with the Lightrun co-founders Ilan Peleg and Leonid Blouvshtein in 2021. They described a future where observability is table stakes and remediation is the game—and now, they are commercializing that vision in the enterprise.
This combination of execution speed, technical rigor, and obsessive customer focus convinced us that this is the team to redefine how software is maintained and repaired in the age of AI-native development.
The Future of Software Resilience
The concept of “self-healing software” has floated around whiteboards and conference rooms for years. We’re uniquely bullish on Lightrun’s credible implementation with real enterprise adoption and the resources necessary to push it into the mainstream.
As autonomous software development becomes the norm, autonomous remediation will become increasingly critical. Lightrun is pioneering this transformation today, protecting billions in transactions and laying the groundwork for a future where software systems can detect and repair issues with minimal human intervention.
We couldn’t be happier to support their mission to make Friday deployments boring again—and to give every engineering team a safety net that gets tighter each time it’s used.
If your organization is fighting fires in production or looking to cut mean time to resolution by an order of magnitude, we’d love to introduce you to the Lightrun team.