163

I'm speaking of online data harvested through apps, websites, hardware (such as phones/streaming devices).

I mean if multiple versions of the same harvested data are being sold, wouldn't the value decrease because of the competition? When it comes to aggregate data, how much financial value can there really be in knowing that a million office workers just clicked on the same cat meme?

How does the quantity of time and expense toward "personalization" not simply overshadow the return, given that no one can click on even a small percentage of those numerous ads, let alone buy the shit being advertised?

It just seems like there would come a time when the value of user data is sucked dry, or at least significantly decreased.

top 34 comments
sorted by: hot top controversial new old
[-] DozensOfDonner@mander.xyz 83 points 1 year ago

Seeing how 90% of the ads I get are reeeeeally missing the boat, they might already be over- harvesting my data?

[-] MajorHavoc@lemmy.world 35 points 1 year ago

Exactly.

Alternately, perhaps the algorithm knows me better than I know myself. Perhaps the humidifier I just bought is about to kick off a new hobby of collecting humidifiers. /s

[-] AA5B@lemmy.world 12 points 1 year ago

Yep, just like a couple years back when I bought a part for my Toyota. Maybe I was about to collect the same party for a Mazda and a GMC

[-] Papanca@lemmy.world 4 points 1 year ago

I almost spit out my tea, thanks!

[-] tooclose104@lemmy.ca 8 points 1 year ago

It's getting weird. I'm suddenly seeing a lot of things pretending to not be ads but specifically referencing my age or things generally blanketed to my age group. But weird things like life insurance and dick pills. Stuff I've never searched before because I already have employer provided insurance and my shmegan works as intended.

[-] DozensOfDonner@mander.xyz 4 points 1 year ago

Yeah I've got ads for very weird, retiree, conservative dad shirts a lot a while back, with text like "dad: I fix things and like cars" or something dumb like that.

I'm in my thirties, European, super left, don't like/never wil have kids. Wtf?

(By the way algorithms those facts are just a present, see what you wanna do with it)

[-] BitingChaos@lemmy.world 5 points 1 year ago

I'm still waiting for the spooky stuff I've been hearing about for years.

I'm being tracked. My information sold and exchanged. Big, evil corporations trading my data with other companies like it was baseball cards. All my inner-most desires leaked!

All to get an ad for a "litter box robot" or whatever when I'm browsing memes.

[-] DozensOfDonner@mander.xyz 2 points 1 year ago

I did notice that one we talked about vacuum cleaners or something, we got some ads about that. Weird i guess. But 90% of the time with household stuff like that its: research something (and start getting ads), buy something and keep getting ads for another month. And companies are posting top dollar for those ad- companies and their algorithms???

[-] thonofpy@lemmy.world 3 points 1 year ago

Y'all are getting ads?

[-] HeyThisIsntTheYMCA@lemmy.world 1 points 1 year ago

My ads think I'm an oncologist. I've been pretty good at obscuring my digital footprint but this latest bit is a little weird.

[-] DozensOfDonner@mander.xyz 2 points 1 year ago

Wait what? You get ads for oncologists? Like what? Cancer medication?

[-] HeyThisIsntTheYMCA@lemmy.world 1 points 1 year ago* (last edited 1 year ago)

Continuing education mostly. Retreats, online courses, stuff like that.

I've had a few major surgeries that are often used to treat cancer, and I to understand what I was getting myself into I basically had to put myself through an abbreviated online med school. Got actual med school syllabi, went through the relevant classes, read the textbooks, asked my doctors anything I didn't understand and a few verification questions to make sure I was understanding the content properly. I basically know enough to be dangerous. Right around then I started getting a lot of ads for jobs in the medical field and I guess I tricked the algorithm into thinking I got a good one.

[-] DozensOfDonner@mander.xyz 2 points 1 year ago

Ah lol, that makes sense actually. I guess the algorithm isn't nuanced enough to pick out a case such as yours

[-] agent_flounder@lemmy.one 32 points 1 year ago

Hm I wonder if generative AI could somehow be used to create fake user behavior to obscure my real behavior.

[-] Nibodhika@lemmy.world 25 points 1 year ago

There already are tools out there that obfuscate your data by making it purposefully wrong, e.g. https://adnauseam.io/ is an ad blocker that also clicks every single ad sent your way.

[-] redballooon@lemm.ee 9 points 1 year ago

Also benefiting the ad companies who charge per click, while not ever leading to a sale.

[-] Dr_Cog@mander.xyz 6 points 1 year ago

It mostly benefits the website that is being visited. If an ad company sells ad spots that don't convert to sales, the organizations buying the ad spots will be less likely to buy them in the future, potentially hurting the ad company

[-] Kimano@lemmy.world 1 points 1 year ago

Yeah but this also results in the site receiving less money from ads, also hurting them in the long run. Really the solution is to pay for the services you enjoy, and acknowledge the age-old "if the service is free, you are the product".

[-] Nibodhika@lemmy.world 1 points 1 year ago

It has always been my understanding that sites and apps get paid X cents for displaying an ad, and Y cents (where Y is larger than X) whenever someone actually clicks an ad on their site. So this would in theory help the website.

Slight spoiler for Neal Stephenson's book "Fall (or: Dodge in Hell)".That idea is used early-ish in the book to muddy the waters and regain some privacy for a significant character. There's a subplot involving a conspiracy theory being intentionally spread through online message boards and this character becomes a target of harassment. By flooding the sites with a full spectrum of scandals and red herrings, they're able to drown out the pieces of legitimate personal info and confuse people enough that they move on to new topics.

[-] driving_crooner@lemmy.eco.br 23 points 1 year ago* (last edited 1 year ago)

I think we could run in the same effect as energy have, when a new source of energy is found, it's not reduce any energy use, it actually increases the use of it because is now cheaper.

With more data, more companies that use statistical analysis to processes it are going to pop up, new uses for that data is found and more data is going to be needed.

I work in insurance, with insurance bonds, nowadays the underwriters check not only the financial information of the company buying the policy, but also the personal and financial information of the executives, directors and their families. We didn't went so far 10 years ago because it was expensive, in the future, maybe we even add employees and their families on the underwriting assessment too.

[-] Ryantific_theory@lemmy.world 8 points 1 year ago

Yeah, I could see the financial value dropping, with businesses less willing to pay as much for harvested data, but I don't see a point in time where they don't attempt harvest every last piece of data on the off chance somebody wants it though. Advertisers paid insane amounts of money for targeted information, but even Google's seen a huge contraction in their advertising revenue.

Doesn't mean they aren't frantically trying to harvest data more aggressively (just recently tried to bake it into the internet itself), just that our data is getting cheaper.

[-] zcd@lemmy.ca 20 points 1 year ago* (last edited 1 year ago)

As AI generated word salads start polluting the Internet more and more, It will become kind of useless for training future AI models and advertising. It’s logical to think that confirmed human generated content would become more valuable perhaps.. Check out the dead Internet theory

[-] MajorHavoc@lemmy.world 7 points 1 year ago

The cool part is that we're definitely going to see people paying top dollar for certified human generated data, that happens to be generated by an AI that can beat Captcha better than humans can.

It's a tiny bright spot in this mess, but I'm really looking forward to laughing at someone for it.

[-] dustyData@lemmy.world 17 points 1 year ago* (last edited 1 year ago)

Data analysis, big data, etc. Already hit a point of data pollution concern. There are more datasets available than there are possible scientific and financial uses in a given timeframe. Basically, there's people whose job is to sit around all day and try to find uses with any ROI for data that companies already paid for or already collected without any particular goal in mind. Because if they don't, then those expenses will turn into losses.

Think of it this way, while companies would pay dearly for datasets for advertisement, for example, these include the data of several million people. While the individual data point you personally generated would amount to fractions of a cent in value. So, our data is already heavily devalued.

At the same time, the highest cost is in scrubbing, cleaning, tailoring and analyzing data. Turning data into actionable information. Collecting the data itself is only a small portion of costs. While the vast majority of data collected is low quality or straight up useless. To mitigate data pollution, there's a whole field of data science whose sole job is to decide what needs to be deleted and thrown away, for it harms the usability of the rest of the dataset.

Equally, companies pay millions in online advertisements, but the individual impression you see on a webpage or search results cost a few cents at most.

[-] Aceticon@lemmy.world 15 points 1 year ago* (last edited 1 year ago)

If the same data about you is collected several times by several different companies, each individual data item it will of course be worth less even if only because there are now multiple competing companies trying to sell that data item harvested from you.

You see, unlike with creative works for which governments created artificial monopolies (which is what IP legislation is all about), data about you doesn't have any such thing as Copyright limiting the copying of it (if it did, the copyright would naturally be yours, which would screw all those data harvesting businesses, which is why politicians would never ever make it so), so it acn be collected multiple times and copied at will unless in some kind of processed way.

If you think of it, it's not by chance that Google's collected private data isn't sold directly but rather advertising services are sold on top of it: if the data never leaves the private-data-harvesting company's system there won't be tons of copies floating around. What's sold is an ad service, rather than the data, so the collection of that data by other companies won't devalueate Google's profit from eit quite as much because they're not selling the data but a service on top of it and those other companies won't be able to sell quite the same service unless they reach Google' breadth of spying on people.

However everybody and their dog is following that same strategy so everybody and their dog are reaching levels of spying on people close enough to Google's that we still end up with enough competition on "ads over private data" and hence indirectly that lowers the value for that data.

[-] Asudox@lemmy.world 1 points 1 year ago

Great. Now I am going to let companies harvest data about me every second so one day they'll just let me be because I make them no profit.

[-] Aceticon@lemmy.world 2 points 1 year ago

Well, the great new unexploited field of data gathering is in toiled habits, so expect an explosion of smart-toilets that gather data on your bowel movemnts and send it to central servers so that you can be presented with personally tailored ads for food-supplements.

[-] Asudox@lemmy.world 2 points 1 year ago

What a wonderful world we live in.

[-] DogMuffins@discuss.tchncs.de 13 points 1 year ago

I think the sources of data will change. Things like browsing habits will become useless, but other tools you use will harvest very specific data.

[-] superduperenigma@lemmy.world 14 points 1 year ago

We're already there. There's a reason companies keep pushing this idea that every single goddamn thing in your household needs to be a "smart" device that's connected to an account.

[-] DogMuffins@discuss.tchncs.de 4 points 1 year ago

Well we've been on this trajectory for a long time but we're nowhere near "there".

Imagine a personal AI assistant. Not the bullshit voice assistants we have now, but an AI that is trained to assist me personally and predict my needs. Not only would it contain a wealth of information about me, but it could feed me promoted selections as well.

[-] NightAuthor@lemmy.world 8 points 1 year ago

Nice idea, as the data becomes worth less, there will be more incentive to collect more data and more invasive data.

[-] etchinghillside@reddthat.com 7 points 1 year ago

Vague hopes that it would drive advancements in compression methods and storage costs.

this post was submitted on 23 Sep 2023
163 points (98.8% liked)

No Stupid Questions

35806 readers
368 users here now

No such thing. Ask away!

!nostupidquestions is a community dedicated to being helpful and answering each others' questions on various topics.

The rules for posting and commenting, besides the rules defined here for lemmy.world, are as follows:

Rules (interactive)


Rule 1- All posts must be legitimate questions. All post titles must include a question.

All posts must be legitimate questions, and all post titles must include a question. Questions that are joke or trolling questions, memes, song lyrics as title, etc. are not allowed here. See Rule 6 for all exceptions.



Rule 2- Your question subject cannot be illegal or NSFW material.

Your question subject cannot be illegal or NSFW material. You will be warned first, banned second.



Rule 3- Do not seek mental, medical and professional help here.

Do not seek mental, medical and professional help here. Breaking this rule will not get you or your post removed, but it will put you at risk, and possibly in danger.



Rule 4- No self promotion or upvote-farming of any kind.

That's it.



Rule 5- No baiting or sealioning or promoting an agenda.

Questions which, instead of being of an innocuous nature, are specifically intended (based on reports and in the opinion of our crack moderation team) to bait users into ideological wars on charged political topics will be removed and the authors warned - or banned - depending on severity.



Rule 6- Regarding META posts and joke questions.

Provided it is about the community itself, you may post non-question posts using the [META] tag on your post title.

On fridays, you are allowed to post meme and troll questions, on the condition that it's in text format only, and conforms with our other rules. These posts MUST include the [NSQ Friday] tag in their title.

If you post a serious question on friday and are looking only for legitimate answers, then please include the [Serious] tag on your post. Irrelevant replies will then be removed by moderators.



Rule 7- You can't intentionally annoy, mock, or harass other members.

If you intentionally annoy, mock, harass, or discriminate against any individual member, you will be removed.

Likewise, if you are a member, sympathiser or a resemblant of a movement that is known to largely hate, mock, discriminate against, and/or want to take lives of a group of people, and you were provably vocal about your hate, then you will be banned on sight.



Rule 8- All comments should try to stay relevant to their parent content.



Rule 9- Reposts from other platforms are not allowed.

Let everyone have their own content.



Rule 10- Majority of bots aren't allowed to participate here.



Credits

Our breathtaking icon was bestowed upon us by @Cevilia!

The greatest banner of all time: by @TheOneWithTheHair!

founded 1 year ago
MODERATORS