Visual Studio Team Services Stops Twice -- Visual Studio Magazine

Visual Studio Team Services Stops Twice

When your goal is 99.999-percent uptime, there's a small crack in the window for downtime. That happened with Visual Studio Team Services last week. And it happened within a span of two days.

By Michael Domingo
02/09/2016

When Microsoft's cloud services guarantee is for 99.999-percent uptime, it means the window is cracked open ever so slightly for a period of downtime. That happened with Visual Studio Team Services withiin a span of two days just recently. Microsoft's Brian Harry detailed both stoppages in a pair of detailed blog posts (here and here).

The first stoppage occurred on February 3, at 3:30 GMT, with worldwide impact to the service. This incident kept VSTS quiet for three hours. As Harry noted in a blog, "We saw a large spike in response times from less than 10% of the requests coming from the browser. At this point alerts fired and we engaged our on-call DRIs." He said that the issue centered on missing Azure DNS entries, which occurred because of a change -- which, unknowingly, introduced a bug -- in how the team performs automated cleanup of DNS entries for inactive accounts. And this problem cascaded to another separate but related incident which was "was mitigated by failing over to the secondary SQL server."

Then on Feb. 4, at 9:11 GMT, users in large numbers started reporting login issues and slowness of response from VSTS. "The root cause is that we (the Team Services team) changed the SQL Azure query processor compat level from 100 (SQL Server 2008) to 120 (SQL Server 2014) on one of the SPS databases," said Harry. The change cascaded to other problems, which started to eat up memory as queries started to come in from users. From there, he explains in a fairly detailed and visual manner the incident response timeline up to final mitigation, which spanned five hours.

What is interesting about the two incidents is the thoroughness of reporting that Microsoft provided on these stoppages. "Those two post are a fascinating insight into incident response, problem solving and lessons learned," notes one commenter named Cedric. " They make a very interesting read for anybody involved in systems architecture and incident response team.

One interesting tidbit from the incidents that Harry revealed is that the visualizations he used to illustrate the problems he described in the blogs were created by a tool that's part of Application Insights called "Kusto," at least while it's in development. Harry explains: "Pay attention at the //Build/ conference. We're going to be talking about it. We've already got many dozens, maybe hundreds of services across Microsoft using it and it is ingesting/querying ~300TB of telemetry per day and growing VERY rapidly."

About the Author

You Tell 'Em, Readers: If you've read this far, know that Michael Domingo, Visual Studio Magazine Editor in Chief, is here to serve you, dear readers, and wants to get you the information you so richly deserve. What news, content, topics, issues do you want to see covered in Visual Studio Magazine? He's listening at [email protected].

Printable Format

comments powered by Disqus

Featured

As Agentic AI Explodes, Microsoft Announces MS365 Copilot Agent Debugging

Microsoft announced agent debugging functionality for Microsoft 365 Copilot directly from the AI tool itself, no Visual Studio 2022 or Visual Studio Code needed.
Creating Business Applications Using Blazor

Expert Blazor programmer Michael Washington' will present an upcoming developer education session on building high-performance business applications using Blazor, focusing on core concepts, integration with .NET, and best practices for development.
GitHub Celebrates Microsoft's 50th by 'Vibe Coding with Copilot'

GitHub chose Microsoft's 50th anniversary to highlight a bevy of Copilot enhancements that further the practice of "vibe coding," where AI does all the drudgery according to human supervision.
AI Coding Assistants Encroach on Copilot's Special GitHub Relationship

Microsoft had a great thing going when it had GitHub Copilot all to itself in Visual Studio and Visual Studio Code thanks to its ownership of GitHub, but that's eroding.
VS Code v1.99 Is All About Copilot Chat AI, Including Agent Mode

Agent Mode provides an autonomous editing experience where Copilot plans and executes tasks to fulfill requests. It determines relevant files, applies code changes, suggests terminal commands, and iterates to resolve issues, all while keeping users in control to review and confirm actions.

Subscribe on YouTube

.NET Insight

Email Address*Country*

Please type the letters/numbers you see above.

Upcoming Training Events

0 AM

VSLive! 4-Day Hands-On Training Seminar: Hands-on with Blazor
May 5-8, 2025

Cybersecurity & Ransomware Live! VirtCon 2025
May 13-15, 2025

VSLive! 4-Hour In-Depth Workshop: Deep Dive into ASP.NET Core Razor Pages
May 29, 2025

VSLive! 3-Day Hands-On Training Seminar: Master Modern JavaScript: Unlock the Full Potential of Your Code
June 2-4, 2025

VSLive! 2-Day Hands-On Training Seminar: Asynchronous and Parallel Programming in C#
June 24-25, 2025

VSLive! 4-Day Hands-On Training Seminar: Immersive .NET Full Stack Training: 4-Day Hands-On Experience
July 15-18, 2025

Visual Studio Live! @ Microsoft HQ
August 4-8, 2025

Visual Studio Live! San Diego
September 8-12, 2025

Live! 360 2-Day Hands-On Seminar: Swimming in the Lakes of Microsoft Fabric and AI – A Hands-on Experience
September 18-19, 2025

VSLive! 2-Day Hands-On Training Seminar: Hands-On with .NET Web Development in 2025
October 7-8, 2025

Live! 360 Orlando
November 16-21, 2025

Artificial Intelligence Live! Orlando
November 16-21, 2025

Cloud & Containers Live! Orlando
November 16-21, 2025

Cybersecurity & Ransomware Live! Orlando
November 16-21, 2025

Data Platform Live! Orlando
November 16-21, 2025

Visual Studio Live! Orlando
November 16-21, 2025

VSLive! 4-Day Hands-On Training Seminar: Immersive .NET Full Stack Training: 4-Day Hands-On Experience
December 16-19, 2025

Free Webcasts

> More Webcasts