454
submitted 1 year ago by soyagi@yiffit.net to c/technology@lemmy.ml

Source: https://front-end.social/@fox/110846484782705013

Text in the screenshot from Grammarly says:

We develop data sets to train our algorithms so that we can improve the services we provide to customers like you. We have devoted significant time and resources to developing methods to ensure that these data sets are anonymized and de-identified.

To develop these data sets, we sample snippets of text at random, disassociate them from a user's account, and then use a variety of different methods to strip the text of identifying information (such as identifiers, contact details, addresses, etc.). Only then do we use the snippets to train our algorithms-and the original text is deleted. In other words, we don't store any text in a manner that can be associated with your account or used to identify you or anyone else.

We currently offer a feature that permits customers to opt out of this use for Grammarly Business teams of 500 users or more. Please let me know if you might be interested in a license of this size, and I'II forward your request to the corresponding team.

you are viewing a single comment's thread
view the rest of the comments
[-] CaptObvious@literature.cafe 7 points 1 year ago

It still isn't clear why anyone uses a product developed by non-native speakers to check their writing. For anyone who knows grammar, Grammarly sometimes makes... interesting... suggestions.

[-] eager_eagle@lemmy.world 24 points 1 year ago* (last edited 1 year ago)

As a non-native speaker I'm surprised to the amount of grammar mistakes native speakers make. Being a native speaker is not a testament to how much of the language you know. And even that being true, it's not like a real human corrects your text, so the creators being native or not is pretty much irrelevant.

[-] kifujin@kbin.social 15 points 1 year ago

at the amount of grammar mistakes

[-] eager_eagle@lemmy.world 9 points 1 year ago
[-] CaptObvious@literature.cafe 4 points 1 year ago

They’d’ve gotten it wrong too. Prepositions and postpositions are their own category of linguistic hell, especially in idioms and phrasal verbs.

[-] SheeEttin@lemmy.world 2 points 1 year ago* (last edited 1 year ago)

They'dn't've necessarily gotten it wrong. With a big enough dataset, an ML tool should be pretty accurate, at least in that it will make the same choices as most people have made in their writing.

[-] CaptObvious@literature.cafe 1 points 1 year ago

They'd'n'tve

Apostrophe mistakes aside, no native speaker would stack contractions like this. There’s an upper limit of three words in a single contracted form. It would be “They wouldn’t’ve gotten” or “They’d not’ve gotten.”

ML tools don’t write grammatically correct complex sentences precisely because their training sets contain too many discrepancies. They may learn how to apply prescriptive rules consistently one day, perhaps even one day soon, but this is not that day.

[-] SheeEttin@lemmy.world 3 points 1 year ago

Who says there's an upper limit? You might not be one of those people, but I'm.

Also, that'll teach me to try to write tricky comments while also doing other things. Fixed.

[-] sugar_in_your_tea@sh.itjust.works 2 points 1 year ago* (last edited 1 year ago)
[-] CaptObvious@literature.cafe 2 points 1 year ago

LOL! How did I not know about this? Thanks!

[-] CaptObvious@literature.cafe 0 points 1 year ago

Who says there's an upper limit?

Well, linguists say it. But you do you, friend.

Also, that'll teach me to try to write tricky comments while also doing other things.

LOL! Right there with you. If I had a dollar for every time this happens to me…. 😄

[-] CaptObvious@literature.cafe 3 points 1 year ago

Native speakers don’t usually make major grammar mistakes. They may not follow prescriptive rules, but they’re generally understandable by other native speakers because grammar is so deeply embedded in their subconscious that they can’t help handling the language correctly. You do the same in your native language. Everyone does.

The problem with non-natives, and I include myself as a non-native speaker of a few languages, is that we don’t usually have the same instincts. It would be pretty arrogant to tell a native that they don’t know how to use their own language when we, almost by definition, cannot possibly understand it in the same way that they do.

[-] merde@sh.itjust.works 3 points 1 year ago

well said/written

it's not only that "we don't usually have the same instincts", we have a burden of confusing loans, imports, translations, false friends &c.

When you start dealing with gendered languages, it's even worse. There's no logic to it. A hand is a she in one language, a he in another and neutral in third.

also, this pronoun question of culture wars is ridiculous for someone who can speak non-gendered languages 🤷

[-] CaptObvious@literature.cafe 2 points 1 year ago

(Love your handle)

I get what you're saying about gendered languages. But if you speak one long enough, even as a non-native, you'll start to develop a feel for genders and be able to predict them to some degree. So far as I know, the mechanism that determines gender is so deeply subconscious that no one has been able to find and articulate its rules, but it seems to exist.

Re: culture wars - The pronoun question is probably moot point in truly genderless languages. English, unfortunately, is not completely genderless, so it's a bone of contention in the current climate.

[-] eager_eagle@lemmy.world 0 points 1 year ago

all of which is irrelevant to how grammarly works

[-] skullgiver@popplesburger.hilciferous.nl 9 points 1 year ago* (last edited 1 year ago)

[This comment has been deleted by an automated system]

[-] argv_minus_one@beehaw.org 5 points 1 year ago

Email spam usually has heavily flawed English.

I've heard that this is intentional. It would be a waste of the spammer's time to be contacted by people who are smart enough to not be fooled. Those smart people won't bother contacting the spammer and wasting the spammer's time if they see grammatical errors in a message that purports to be from a reputable organization, so the spammer throws in some errors to make the smart people filter themselves out. Or so the theory goes.

[-] skullgiver@popplesburger.hilciferous.nl 5 points 1 year ago* (last edited 1 year ago)

[This comment has been deleted by an automated system]

[-] CaptObvious@literature.cafe 2 points 1 year ago

I've seen this filtering hypothesis, and it seems plausible. OTOH, it also gives James Veitch some fantastic material for his comedy routine.

[-] SheeEttin@lemmy.world 4 points 1 year ago

*nitpicker (but I prefer pedant in polite circles, and grammar nazi on the Internet, or at least I did until actual nazis started showing up again)

[-] skullgiver@popplesburger.hilciferous.nl 1 points 1 year ago* (last edited 1 year ago)

[This comment has been deleted by an automated system]

[-] CaptObvious@literature.cafe 2 points 1 year ago

Certain uni composition students had better learn to write flawless English if they expect to earn their desired grade in my courses.

[-] skullgiver@popplesburger.hilciferous.nl 5 points 1 year ago* (last edited 1 year ago)

[This comment has been deleted by an automated system]

[-] CaptObvious@literature.cafe 1 points 1 year ago

Maybe customer support should take a stronger stance on understanding and being understood using standard dialect. At least the CSRs that I usually seem to talk with could use a good basic communication course.

Students will use what they learn from me more than you think if they want a degree. If they don't want one... well, we have several excellent nearby trade schools where they can learn a skill that won't require formal standard English and will make them a whole lot more money in the long run (I'm honestly saying this respectfully).

this post was submitted on 08 Aug 2023
454 points (94.0% liked)

Technology

34987 readers
197 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 5 years ago
MODERATORS