Differential privacy explained
With iOS 10.3, Apple is hoping to start analyzing user data to improve iCloud. Typically, large internet companies like Google and Facebook want to vacuum all our data up into the cloud so they can feed their search indexes, social graphs, and artificial intelligence projects. They claim to anonymize some of it by stripping off personal identifiers but, data at sufficient volume still paints precise enough patterns that identity can be sometimes still be determined.
Apple's trying to do it in a way that maintains effective anonymity for their users. And one of the ways they're doing that is with a technique called "differential privacy".
Here's what Apple says about it:
How does differential privacy work?
Unveiled at WWDC 2016, differential privacy works by adding a level of noise to data at the point of collection — like when you add a new word to the auto-predict dictionary in the QuickType keyboard.
Once large amounts of data from large volumes of users is collected, statistical analysis is then used to "see" the patterns through the noise. It's impossible to tell who was the source of any particular piece of data, but the frequency, popularity, and other meta-information about that data becomes usable by Apple to improve services.
That's... obscure. Example, please!
Let's say Apple wanted to know whether to suggest "Wars" or "Trek" more often as an auto-complete for "Star", but they want to make sure not to start any family feuds — because sci-fan fans are passionate!
With differential privacy, when collecting the data, Apple flips a virtual coin and then every time the coin comes up "heads", records the answer incorrectly. In other words, it records a "lie". So, when recording a million answers, half of them would be "lies", and it would be impossible to know if any individual answer was the truth or a "lie".
Master your iPhone in minutes
iMore offers spot-on advice and guidance from our team of experts, with decades of Apple device experience to lean on. Learn more with iMore!
By collecting enough responses from enough people, and using statistical analysis to figure out how many lies there were, Apple could get really close to the proper answer, again, without ever knowing exactly which individuals answered which way.
Less noise could be added for areas of less popularity and frequency, more noise for areas with higher popularity and frequency. The system could automatically opt-out anyone who was contributing too much data to protect their privacy.
That's just one example of how noise and statistics can be added to protect privacy but still provide useful data. Apple wants to start using to it improve the quality of Siri responses and related services as well.
Privacy different
For end users, the benefits of crowd sourcing are tangible. If millions of people started typing or saying "shway" for "cool", Apple could quickly add it to dictionaries, without ever needing, or caring, to know whether you typed or said it or not.
In other words, differential privacy lets Apple provide most of the crowd-sourced, artificially intelligent services we want, but without the intrusive data collection practices that creep us out.
If done right, it'll be a real win/win for everyone.
○ How to use two-factor authentication
○ How to protect your data from being hacked
○ How to quickly temporarily disable Face ID
○ Best practices for staying safe on social media
○ Best VPN services
○ How to lock down your data on iPhone and iPad
○ Best ways to increase iPhone and iPad security
○ How to back up your iPhone, iPad, and Mac
○ Differential privacy — Everything you need to know!
Rene Ritchie is one of the most respected Apple analysts in the business, reaching a combined audience of over 40 million readers a month. His YouTube channel, Vector, has over 90 thousand subscribers and 14 million views and his podcasts, including Debug, have been downloaded over 20 million times. He also regularly co-hosts MacBreak Weekly for the TWiT network and co-hosted CES Live! and Talk Mobile. Based in Montreal, Rene is a former director of product marketing, web developer, and graphic designer. He's authored several books and appeared on numerous television and radio segments to discuss Apple and the technology industry. When not working, he likes to cook, grapple, and spend time with his friends and family.