Behind the fine print: Understanding AI apps and privacy

Artificial intelligence has quickly become part of the contemporary zeitgeist — yet ethical considerations around the subject remain unresolved. How many users are fully aware of what they’re signing up to?

Here, by honing in on the terms and conditions and privacy policies behind the most popular AI tools and apps, Ecommerce Platforms unpacks what you need to know when using these tools for your day-to-day business needs.

We’ve analyzed the data and personal information these tools collect (and for what purpose) to help you to determine which AI tools, software, and platforms are the most suitable for your intended use. We also consulted a legal expert to break down the jargon behind these tools’ terms and conditions.

We analyzed the Apple App Store privacy labels for around 30 available mobile app versions of popular AI tools to understand which ones collect your data, and why.

The data collected from users (and its purpose) is divided into 14 categories, making it possible to establish which apps collect and track the most user data.

For further details, take a look at the methodology section at the end of this page.

What data do these AI apps collect?

The AI tools assessed in this research collect data of various types. Some of these focus on personal details about users — from screen names and bank details, to their health and fitness, and even sensitive information such as race/ethnicity, sexual orientation, gender identity, and political opinions.

Others relate to content created by users (like emails and messages, photos, videos, and sound recordings), or how users interact with the product itself, like their in-app search histories or what adverts they’ve seen. More impersonal still is the diagnostic information collected to show crash data or energy use.

Why do these AI apps collect data?

There are different reasons why apps collect data, some of which may be seen as more justifiable than others — for example, biometrics or contact information can be used to authenticate the user’s identity.

Similarly, access to certain data may be required for an app to function correctly, including to prevent fraud or improve scalability and performance.

More specifically, messaging apps need access to contacts, phone cameras, and microphones to allow calls, while geolocation is necessary for taxi or delivery apps.

Arguably less essential reasons to collect data include advertising or marketing by the app’s developer (for example, to send marketing communications to your users); enabling third-party advertising (by tracking data from other apps to direct targeted ads at the user, for instance); and analyzing user behavior for purposes including assessing the effectiveness of existing features or planning new ones.

Column headers with buttons are sortable.
AI app	% data shared with others	Browsing History	Contact Info	Identifiers	Location	Other Data	Purchases	Search History	Usage Data	No. of data points collected
Canva	36%		2	2	1			1	2	8
Duolingo	36%			2	1	1	1		2	7
Google Assistant	21%	1						1	1	3
Bing	14%			1					1	2
Pixai	14%			1					1	2
Wombo	14%			1					1	2
ChatGPT	7%								1	1
Genie AI	7%			1						1
Lensa	7%			1						1
Speechify	7%			1						1
StarryAI	7%								1	1

Of all the AI apps included in our research, Canva, a graphic design tool, collects the most data from its users for third-party advertising purposes — around 36%. By contrast, the five apps that collect the least data for this purpose gather just over 7%.

The data that Canva’s app collects from you and shares with third parties includes your search history, location, email address, and other information shown in the table above.

Closely following Canva is the gamified language-learning app Duolingo (~36%), Google Assistant (~21%), and Microsoft’s Bing (~14%) — all of which also share your data with third parties.

Of the five apps that collect the least data, only starryai (an image-generator) confines itself to solely sharing usage data.

AI apps that collect your data for their own benefit

Column headers with buttons are sortable.
App	% data collected for app’s own benefit	Browsing History	Contact Info	Identifiers	Location	Purchases	Search History	Usage Data	No. of data points collected
Canva	43%		2	2	1	1	1	2	9
Facetune	36%		2	4	2	2		4	14
Amazon Alexa	36%		4	2		1	1	2	10
Google Assistant	36%	1	2	2			1	2	8
PhotoRoom	29%		1	1		1		1	4
Duolingo	21%			2		1		1	4
StarryAI	14%		2					1	3
Bing	14%			1			1		2
Lensa	14%			1		1			2
Otter	7%		2						2
Youper	7%							1	1
Poe	7%			1					1
Pixai	7%							1	1
Speechify	7%			1					1
Wombo	7%							1	1

Canva also tops the chart for AI apps collecting user data for their own advertising or marketing purposes. To do so, Canva collects around 43% of their users’ data.

In third place, Amazon Alexa collects 36% of your data for the same purpose. This includes your email address, physical address, phone number, search history, and purchase history, plus five other data points. Google Assistant collects and shares the same percentage of data for this reason, though across eight individual data points, compared to the ten that Amazon Alexa collects.

The text-to-speech voice generator, Speechify, is among the apps that collect the least data. According to its Apple App Store listing’s privacy labels, Speechify collects just one data point for its own benefit; your device ID.

AI apps that collect your data for any purpose

Column headers with buttons are sortable.
App	% data collected	Browsing History	Contact Info	Contacts	Diagnostics	Financial Info	Health & Fitness	Identifiers	Location	Other Data	Purchases	Search History	Sensitive Info	Usage Data	User Content	No. of data points collected
Amazon Alexa	93%		24	4	9	3	4	10	8	4	5	5	4	13	23	116
Google Assistant	86%	4	8	2	6	1		8	5	2	1	5		8	8	58
Duolingo	79%		10	1	7	1		12	4	4	6	1		7	7	60
Canva	64%		11		3	1		8	5		4	5		10	6	53
Otter	57%		7	3	5			7	2			3		2	11	40
Poe	57%		2	2	3			6			2	3		2	5	25
Facetune	50%		6		8			18	8		8			14	2	64
Bing	50%		1		2			6	3			3		2	3	20
DeepSeek	50%		2		3			4	1			1		2	3	16
Mem	43%		6		4			6				6		6	4	32
ELSA Speak	43%		2		6			6			3			3	3	23
PhotoRoom	43%		2		1			9			3			4	1	20
Trint	43%	1	2		1			4						1	2	11
ChatGPT	36%		4		8			5						7	2	26
Perplexity AI	36%				6			6	2		1			6		21

All AI models require some form of training through machine learning — meaning that they need data.

If we want AI tools to improve and become more useful, our privacy can be seen as a necessary trade-off against providing this data.

However, the question of where the line between utility and exploitation should be drawn, and why, is a thorny one.

Given its current notoriety, it’s worth addressing DeepSeek. Its listing on the Apple App Store states that DeepSeek doesn’t collect user data for its own benefit (for example, DeepSeek’s own marketing and advertising) or to share with third parties.

But it’s worth pointing out that their Privacy Policy states otherwise (more on this later).

The DeepSeek app itself collects 50% of its users’ data, which serves DeepSeek’s Analytics and App Functionality. For comparison, the ChatGPT app collects 36%.

Some media outlets report concerns about security risks related to DeepSeek’s Chinese origins (both in terms of data collection and the possible spread of misinformation) and the undercutting of US rivals. Both are unlikely to be alleviated by DeepSeek’s Terms and Conditions and Privacy Policy, which would take around 35 minutes to read, and are rated as “very difficult” on the Flesch-Kincaid readability scale.

Regardless of how your data is used, Amazon Alexa collects more of its users’ data than any other AI app included in this research. Overall, it collects 93% of your data (or 116 individual metrics, primarily contact info, user content, and usage data).

Google Assistant comes next, collecting 86%, followed by Duolingo, which collects 79%.

At the other end of the scale, AI image generator, Stable Diffusion, does not collect any data. That’s according to privacy labels on its Apple App Store listing.

While it’s true that all generative AI models require massive amounts of data to be trained, this training happens prior to the development of specific apps. In most cases, app creators don’t own the AI models they use; user data collection therefore relates to the functionality of the app itself. This may explain why some of the apps we’ve investigated have no information in the above table.

Now, let’s look at the legal documentation behind different AI tools to find out how easy or difficult they are to read. This is based on the Flesch-Kincaid reading-grade-level test.

This system equates texts to US school reading levels (from fifth to 12th grade), then College, College Graduate, and Professional. Sixth grade-level texts are defined as “conversational English for consumers”, whereas professional-rated texts are described as “extremely difficult to read”.

The lower the readability score, the harder the text is to read.

Column headers with buttons are sortable.

Clipchamp	3 hours 16 minutes	27.2	🤯🤯🤯🤯🤯🤯🤯
Bing	2 hours 20 minutes	35.4	🤯🤯🤯🤯🤯
Veed.io	2 hours 15 minutes	37.9	🤯🤯🤯🤯🤯
Facetune	2 hours 4 minutes	34.4	🤯🤯🤯🤯🤯🤯
TheB.AI	1 hours 47 minutes	31.4	🤯🤯🤯🤯🤯🤯
Otter	1 hours 11 minutes	32.4	🤯🤯🤯🤯🤯🤯
Jasper	1 hours 9 minutes	22.0	🤯🤯🤯🤯🤯🤯🤯🤯
Gamma	1 hours 6 minutes	30.9	🤯🤯🤯🤯🤯🤯
Speechify	1 hours 6 minutes	35.0	🤯🤯🤯🤯🤯🤯
Runway	1 hours 2 minutes	28.3	🤯🤯🤯🤯🤯🤯🤯

In 2023, British business insurance company Superscript polled the owners of 500 small and medium-sized enterprises (SMEs) to find out what consumes the most of their time.

Tellingly, ‘getting enough sleep’ — crucial for physical and mental health and cognitive function — ranks third, trailing ‘working long hours’ and ‘sorting out tax returns’.

A third of those polled felt that it wasn’t possible to do all of their admin during working hours, and said they needed four extra hours a day to get through it all.

This gives a sense of how punishing it can be to run an SME, and that the time needed to read the terms and conditions behind the tools they rely on is not easy to come by.

In this context, the 40-minute read-times of the T&Cs for transcribing tools like Otter, Trint, and Descript, is highly consequential.

And that’s assuming it’s possible to understand the most hard-to-read terms and conditions. This is why we sought the expertise of a legal expert.

We asked a legal expert in AI and tech to read them and explain key points you need to know

Josilda Ndreaj, a legal professional and licensed attorney, has navigated complex legal matters on behalf of Fortune 500 clients and provided counsel to various corporations.

More recently, as an independent consultant, she has focused on intellectual property law at the intersection of blockchain technology and artificial intelligence.

Josilda Ndreaj (LLM) is a legal professional and licensed attorney with expertise in Intellectual Property (IP) law.

Her career as a legal consultant began in a prestigious international law firm, catering to Fortune 500 clients. Here, Josilda navigated complex legal matters and provided counsel to different corporations.

Driven by interests in innovation, creativity, and emerging technologies, Josilda then ventured into independent consultancy and focused on Intellectual Property law, by covering the intersection with blockchain technology and artificial intelligence.

Josilda holds two Master of Law degrees; one specializing in civil and commercial Law from Tirana University, and the other focusing on intellectual property law, from Zhongnan University of Economics and Law.

As such, Josilda was uniquely positioned to review a selection of these AI tools’ legal documents, pulling out key points for the benefit of those of us who don’t hold two Master of Laws degrees.

Her summaries are outlined below:

Behind the fine print: Understanding AI apps and privacy

When you use an AI app, you consent to (at least some of) your data being collected by it

What data do these AI apps collect?

Why do these AI apps collect data?

AI apps that collect your data to share with third-party advertisers

AI apps that collect your data for their own benefit

AI apps that collect your data for any purpose

We asked a legal expert in AI and tech to read them and explain key points you need to know

Plagiarism and copyright infringement

Accuracy and reliability

Security and confidentiality

Usage

Plagiarism and copyright infringement

Accuracy and reliability

Security and confidentiality

Usage

Plagiarism and copyright infringement

Accuracy and reliability

Security and confidentiality

Usage

Plagiarism and copyright infringement

Accuracy and reliability

Security and confidentiality

Usage

Plagiarism and copyright infringement

Accuracy and reliability

Security and confidentiality

Usage

Plagiarism and copyright infringement

Accuracy and reliability

Security and confidentiality

Usage

Plagiarism and copyright infringement

Accuracy and reliability

Security and confidentiality

Usage

Plagiarism and copyright infringement

Accuracy and reliability

Security and confidentiality

Usage

Plagiarism and copyright infringement

Accuracy and reliability

Security and confidentiality

Usage

Plagiarism and copyright infringement

Accuracy and reliability

Security and confidentiality

Usage

Plagiarism and copyright infringement

Accuracy and reliability

Security and confidentiality

Usage

Plagiarism and copyright infringement

Accuracy and reliability

Security and confidentiality

Usage

Conclusion

Methodology and Sources

How we conducted the research

Sources

Correction requests

$45000 A Year Is How Much An Hour?

Why are all my cryptos down?

Leave a Reply Cancel reply

POPULAR POSTS

Categories

Connect With Us

Recent Posts