Tech

White Hat Hackers Discover Microsoft Leak of 38TB of Internal Data Via Azure Storage

Byadmin September 19, 2023

The Microsoft leak, which stemmed from AI researchers sharing open-source training data on GitHub, has been mitigated.

Microsoft has patched a vulnerability that exposed 38TB of private data from its AI research division. White hat hackers from cloud security company Wiz discovered a shareable link based on Azure Statistical Analysis System tokens on June 22, 2023. The hackers reported it to the Microsoft Security Response Center, which invalidated the SAS token by June 24 and replaced the token on the GitHub page, where it was originally located, on July 7.

Jump to:

SAS tokens, an Azure file-sharing feature, enabled this vulnerability

The hackers first discovered the vulnerability as they searched for misconfigured storage containers across the internet. Misconfigured storage containers are a known backdoor into cloud-hosted data. The hackers found robust-models-transfer, a repository of open-source code and AI models for image recognition used by Microsoft’s AI research division.

The vulnerability originated from a Shared Access Signature token for an internal storage account. A Microsoft employee shared a URL for a Blob store (a type of object storage in Azure) containing an AI dataset in a public GitHub repository while working on open-source AI learning models. From there, the Wiz team used the misconfigured URL to acquire permissions to access the entire storage account.

When the Wiz hackers followed the link, they were able to access a repository that contained disk backups of two former employees’ workstation profiles and internal Microsoft Teams messages. The repository held 38TB of private data, secrets, private keys, passwords and the open-source AI training data.

SAS tokens don’t expire, so they aren’t typically recommended for sharing important data externally. A September 7 Microsoft security blog pointed out that “Attackers may create a high-privileged SAS token with long expiry to preserve valid credentials for a long period.”

Microsoft noted that no customer data was ever included in the information that was exposed, and that there was no risk of other Microsoft services being breached because of the AI data set.

What businesses can learn from the Microsoft data leak

This case isn’t specific to the fact that Microsoft was working on AI training — any very large open-source data set might conceivably be shared in this way. However, Wiz pointed out in its blog post, “Researchers collect and share massive amounts of external and internal data to construct the required training information for their AI models. This poses inherent security risks tied to high-scale data sharing.”

Wiz suggested organizations looking to avoid similar incidents should caution employees against oversharing data. In this case, the Microsoft researchers could have moved the public AI data set to a dedicated storage account.

Organizations should be alert for supply chain attacks, which can occur if attackers inject malicious code into files that are open to public access through improper permissions.

SEE: Use this checklist to make sure you’re on top of network and systems security (TechRepublic Premium)

“As we see wider adoption of AI models within companies, it’s important to raise awareness of relevant security risks at every step of the AI development process, and make sure the security team works closely with the data science and research teams to ensure proper guardrails are defined,” the Wiz team wrote in their blog post.

TechRepublic has reached out to Microsoft and Wiz for comments.

Tech

Amazon Prime Day October 2023: 5 Best Laptop Deals

Check out our roundup of the best Amazon Prime Day laptop deals and discounts to help you find the perfect device at a great price. Amazon Prime’s Big Deal Days 2023 is finally here, and it’s the perfect opportunity to get your hands on excellent laptops at competitive prices. Whether you are looking for a…

Tech

7 Best B2B Database Providers for 2024

B2B database providers are companies that offer a range of data services like workflow management, data enrichment and sales intelligence through an inhouse platform or web integrations. They source leads that match your clientele and build databases with their most accurate contact information and sales predictions. Top B2B database providers comparison Businesses of any size…

Tech

This beginner-friendly ethical hacker training is 97% off

शुरुआती से एथिकल हैकिंग सर्टिफिकेशन तक हैक कैसे करें आपको सिखाएगा कि आप अपने सिस्टम की सुरक्षा कैसे करें और शीर्ष ग्राहकों का विश्वास कैसे अर्जित करें। छवि: स्टैककॉमर्स इससे पहले कि कोई व्यवसाय आपको अपने आईटी सिस्टम पर काम करने के लिए किराए पर लेगा, उन्हें भरोसा करना होगा कि आप उनके डेटा को…

Tech

How to Expand Your Value as CIO in 4 Ways

A Gartner analyst details how CIOs can deliver executive leadership and expand their responsibilities in four ways in the next era of digitalization. Image: iStockphoto/fizkes In the early days of digitalization, CIOs recognized the potential value of advancing technology and led their peers and stakeholders as evangelists, extolling new opportunities and business models through digital….

Tech

Best Accounting Project Management Software for 2023

Project management (PM) software is a universal need to keep track of goals, research, data, scheduling and everything else under the sun. If you have more specific needs, however, you may have to find a PM solution with a specialized focus. Accounting project management software can simultaneously help teams perfect their finances and exceed business…

Tech

IT and Security Pros Are ‘Cautiously Optimistic’ About AI

Google क्लाउड द्वारा कमीशन क्लाउड सिक्योरिटी एलायंस की एक रिपोर्ट के अनुसार, सी-सूट अपने आईटी और सुरक्षा कर्मचारियों की तुलना में एआई प्रौद्योगिकियों से अधिक परिचित है। 3 अप्रैल को प्रकाशित रिपोर्ट में बताया गया है कि क्या आईटी और सुरक्षा पेशेवरों को डर है कि एआई उनकी नौकरियों की जगह ले लेगा, जेनेरिक एआई…

SAS tokens, an Azure file-sharing feature, enabled this vulnerability

What businesses can learn from the Microsoft data leak

Similar Posts