Home
Marketing
Yikes, Internal Microsoft Data Leaked While Sharing AI Training Data

Yikes, Internal Microsoft Data Leaked While Sharing AI Training Data

Written by: Martina Bretous

AI NEWS IN YOUR INBOX

A weekly newsletter covering AI and business.

Internal Microsoft Data Leaked While Sharing AI Training Data

Updated: 09/21/23

Published: 09/20/23

Ever show someone a specific picture in your Photos album and they start swiping left and right, seeing things they weren’t supposed to? That’s kind of what happened with Microsoft last week.

Their AI research team published AI training data on GitHub, a cloud-based repository where developers store and manage code. Turns out they also granted public access to 38 terabytes of sensitive information.

Here’s how the error was discovered.

Download Now: The State of AI [Free Report]

The research team at Wiz, a cloud security startup found Microsoft’s GitHub repository, which included a URL to access and download AI models for image recognition, during a routine data exposure tracking exercise.

Unbeknownst to Microsoft, the link also granted users access to the entire storage account, including more than 30k internal Teams messages from over 300 employees, passwords, secret keys, computer backups, and other personal data.

microsoft's leaked Teams messages from github repository

Image Source

But that’s not all – users also had full control of the data itself, which they could use to delete and overwrite files. This means that any bad actor could add malicious code into the models, creating a domino effect impacting every user who downloaded them. Big yikes.

To understand how it happened, we have to get technical for a second.

Azure is Microsoft’s cloud computing platform, and on it are Shared Access Signature (SAS) tokens. Think of these tokens as keys that grant access to Azure storage resources with customizable permissions and expiration dates.

In Microsoft’s case, a token was accidentally included in this publicly accessible URL.

Wiz alerted Microsoft of the issue on June 22 and the token was revoked two days later. Microsoft’s Security Research and Defense team says customer data wasn’t exposed and neither was data from other Microsoft services.

They’ve also taken measures to expand GitHub’s secret scanning service to include the monitoring of SAS tokens.

What’s Wiz’s recommendation? Limit the use of SAS tokens, because they’re difficult to monitor.

As for the bigger takeaway, this incident makes the case for AI research teams and security teams to work together more closely.

Organizations training AI models are working with a much higher volume of data. With that volume comes a need for more robust security checks to prevent breaches.

Topics: Artificial Intelligence

11+ Real-World AI Agent Examples

Mar 24, 2025
AI Image Generators: I Tested 12 of the Best. Here’s the Scoop for Marketers.

Mar 19, 2025
How AI Will Revolutionize the Future of Business, According to HubSpot's CMO

Mar 12, 2025
Why Top Performing Teams Use AI Workflow Automation and How You Can Do the Same

Feb 24, 2025
Which LLM Should You Use for Your Business? [Pros and Cons]

Feb 18, 2025
Is AI-Generated Content Good for SEO?: 300+ Web Strategists Weigh In

Feb 10, 2025
Is it Real or AI? Test Your Detection Skills [Round 4]

Feb 03, 2025
How Our Events Team Saved Thousands using AI for INBOUND '24

Jan 27, 2025
How We Used AI to Increase HubSpot Email Conversions by 82%: A Case Study

Jan 17, 2025
Implementing AI in Your Marketing Tech Stack — Expert Tips and Tricks You Need to Know

Jan 09, 2025

Yikes, Internal Microsoft Data Leaked While Sharing AI Training Data

AI NEWS IN YOUR INBOX

Download Now: The State of AI [Free Report]

11+ Real-World AI Agent Examples

AI Image Generators: I Tested 12 of the Best. Here’s the Scoop for Marketers.

How AI Will Revolutionize the Future of Business, According to HubSpot's CMO

Why Top Performing Teams Use AI Workflow Automation and How You Can Do the Same

Which LLM Should You Use for Your Business? [Pros and Cons]

Is AI-Generated Content Good for SEO?: 300+ Web Strategists Weigh In

Is it Real or AI? Test Your Detection Skills [Round 4]

How Our Events Team Saved Thousands using AI for INBOUND '24

How We Used AI to Increase HubSpot Email Conversions by 82%: A Case Study

Implementing AI in Your Marketing Tech Stack — Expert Tips and Tricks You Need to Know

Thank you!

You've been subscribed

Blogs

Blogs

Marketing

Sales

Service

Website

AI

Instagram Marketing

Customer Retention

Email Marketing

SEO

Sales Prospecting

Newsletters

Newsletters

The Hustle

Masters In Marketing

The Pipeline

Videos

Videos

The Hustle

Marketing with HubSpot

My First Million

Marketing Against the Grain

HubSpot

Podcasts

Podcasts

My First Million

Goal Digger

The Hustle Daily Show

Another Bite

Business Made Simple

Marketing Against the Grain

Online Marketing Made Easy

The Product Boss

Nudge

Side Hustle Pro

Outbound Squad

Resources

Resources

Academy

Templates

Ebooks

Kits

Tools

HubSpot Products

The HubSpot Customer Platform

Overview of all products

Marketing Hub

Sales Hub

Service Hub

Content Hub

Operations Hub

Commerce Hub

About HubSpot

Contact Us

Customer Support

Log in

日本語

Deutsch

English

Español

Português

Français

Yikes, Internal Microsoft Data Leaked While Sharing AI Training Data

AI NEWS IN YOUR INBOX

Download Now: The State of AI [Free Report]

Don't forget to share this post!

Related Articles

11+ Real-World AI Agent Examples

AI Image Generators: I Tested 12 of the Best. Here’s the Scoop for Marketers.

How AI Will Revolutionize the Future of Business, According to HubSpot's CMO

Why Top Performing Teams Use AI Workflow Automation and How You Can Do the Same

Which LLM Should You Use for Your Business? [Pros and Cons]

Is AI-Generated Content Good for SEO?: 300+ Web Strategists Weigh In

Is it Real or AI? Test Your Detection Skills [Round 4]

How Our Events Team Saved Thousands using AI for INBOUND '24

How We Used AI to Increase HubSpot Email Conversions by 82%: A Case Study

Implementing AI in Your Marketing Tech Stack — Expert Tips and Tricks You Need to Know

Thank you!

You've been subscribed