view the rest of the comments
news
Welcome to c/news! Please read the Hexbear Code of Conduct and remember... we're all comrades here.
Rules:
-- PLEASE KEEP POST TITLES INFORMATIVE --
-- Overly editorialized titles, particularly if they link to opinion pieces, may get your post removed. --
-- All posts must include a link to their source. Screenshots are fine IF you include the link in the post body. --
-- If you are citing a twitter post as news please include not just the twitter.com in your links but also nitter.net (or another Nitter instance). There is also a Firefox extension that can redirect Twitter links to a Nitter instance: https://addons.mozilla.org/en-US/firefox/addon/libredirect/ or archive them as you would any other reactionary source using e.g. https://archive.today . Twitter screenshots still need to be sourced or they will be removed --
-- Mass tagging comm moderators across multiple posts like a broken markov chain bot will result in a comm ban--
-- Repeated consecutive posting of reactionary sources, fake news, misleading / outdated news, false alarms over ghoul deaths, and/or shitposts will result in a comm ban.--
-- Neglecting to use content warnings or NSFW when dealing with disturbing content will be removed until in compliance. Users who are consecutively reported due to failing to use content warnings or NSFW tags when commenting on or posting disturbing content will result in the user being banned. --
-- Using April 1st as an excuse to post fake headlines, like the resurrection of Kissinger while he is still fortunately dead, will result in the poster being thrown in the gamer gulag and be sentenced to play and beat trashy mobile games like 'Raid: Shadow Legends' in order to be rehabilitated back into general society. --
I'd like some LLM features centered around "here is my organized folder of documents for context" but without paying $20/month or buying a $2,000 GPU or giving all of my data to Google/Microsoft. I couldn't find anything that actually worked even if I did pay money or give my data to Google/Microsoft. Ignoring even the folder thing, I remember asking Claude to summarize a novel I'm writing and it kept mixing character's with details that unambiguously only applied to other characters.
It doesn't work in the average case. I've seen this tactic from the company that I work for and multiple companies I have contacts at. Bosses think they can simply use "AI" to fix their hollowed out documentation, on-boarding, employee education systems by pushing a bunch of half correct, barely legible "documentation" through an LLM.
It just spits out garbage for 90% of people doing this. It's a garbage in garbage out process. In order for it to even be useful you need a specific type of LLM (a RAG) and for your documentation to be high quality.
Here's an example project: https://github.com/snexus/llm-search
The demo works well because it uses a well documented open source library. It's also not a guarantee that it won't hallucinate or get mixed up. A RAG works simply by priming the generator with "context" related to your query, if your model weights are strong enough your context won't outweigh the allure of statistical hallucination.
I've seen one or two companies doing exactly this, but specific to certain varieties of bureaucracies (mostly insurance companies, unfortunately). It struck me as one of the few potential real uses for LLMs so long as it can provide a bibliography for any responses that can be confirmed deterministically.