#llm Timeline

128d

The DAIR Institute makes sceptical videos warning about the dangerous hype and irresponsible practices currently driving AI, LLMs and related tech. You can follow at:

➡️ @dair

There are already over 70 videos uploaded. If these haven't federated to your server yet, you can browse them all at https://peertube.dair-institute.org/a/dair/videos

You can also follow DAIR's general social media account at @DAIR@dair-community.social

#FeaturedPeerTube #DAIR #AI #LLM #LLMs #OpenAI #SamAltman #Sceptic #Skeptic #PeerTube #PeerTubers

0 0 1 View Post & Replies See Original

71d

Rendez-vous au colloque annuel de l'Association francophone des humanités numériques #Humanistica (22-25 avr. 2025 à #Dakar) pour découvrir les communications de l'équipe du Lab d'Huma-Num (CNRS). Un poster et une communication de Adam Faci et Léa Maronet, membres du HN Lab, intitulée « SegmentArt, une méthode plus rapide pour annoter des images en utilisant SegmentAnything2 ». Présentation des contenus sur https://hnlab.huma-num.fr/blog/tags/humanistica-2025/ #IA #veilleESR #CNRS #ISIDORE #IST #découvrabilité #LLM

Edited 66d ago

0 0 0 View Post & Replies See Original

145d

What if "42" is just the hallucination of an LLM from the future? 🤔

#LLM #LLMs #AI #generativeAI #42

Edited 145d ago

0 0 0 View Post & Replies See Original

52d

So if you thought that surveillance based ads couldn't get any worse... Meta: Hold my beer!

#Facebook #Meta #surveillance #AI #LLM #Ads

https://techcrunch.com/2025/05/07/mark-zuckerbergs-ai-ad-tool-sounds-like-a-social-media-nightmare/?utm_campaign=social&utm_source=threads&utm_medium=organic

0 0 1 View Post & Replies See Original

1y

https://aistudio.google.com/
#Google AI Studio是我最强大的AI工具，使用的 #LLM 当然是 #Gemini ，可以在1.0Pro、1.5Pro和1.5flash中选择（Gemini网页和app上的1.5付费用户专享，这里免费不限量），1.5支持1M tokens（2M版本排队中），温度、安全性等参数可调，完全免费，唯一的问题是没有移动页面。
AI Studio是为开发设计的，比面向用户的聊天UI功能丰富得多，当然也复杂一些。
作为用户，这个开发UI有什么用呢？
首先，你能整本书扔给他，甚至一次几本书也行。梳理情节、分析人物、搜寻段落，效果很不错，就像一个伴读。幻觉仍有，人物经历的前半段通常没问题，结局经常出错，输出太长还是容易胡言乱语。这是处理长文本最好的工具，图片、音频、视频也都支持。
还有个结构化提示功能，可以设置多组输入-输出对作为示例，每组中的输入、输出都可以有多个，输入也可以是图片。Gemini会学习这些示例来处理类似问题。

Edited 1y ago

1 0 0 View Post & Replies See Original

1y

我目前能体验到的还仅仅是文本模式，就已经体会到了 #GPT-4o 的强大。
请依次计算表面积为1的球体、正方体、正四面体的体积，结果用小数表示。
https://chatgpt.com/share/4a746518-63fd-4f39-80b4-ce27820c810b
#4o 的答案简洁干脆，毫无瑕疵。
这个题目就是套公式，毫无难度，但是4o之前没有一个能算对的，除了 #Wolfram Alpha，而它不算 #LLM 。
我一直相信 #数学能力是LLMs最重要的能力之一，因为数学的本质是 #逻辑。 #知识是否丰富其实并不重要，#RAG 可以很好地补充。
这就像大家都想成为知识渊博的聪明人，但是如果只能二选一的话，无知的聪明人总比什么都知道的傻子要好太多，因为知识是容易学习的，而逻辑就难得多。

Edited 1y ago

1 0 0 View Post & Replies See Original

1y

陪孩子看 #三国，看到 #赤壁之战前， #诸葛亮背《 #铜雀台赋》激周瑜抗曹。“揽 #二乔于东南兮，乐朝夕之与共”这句显然有问题。 #曹植一代文豪，写赋公开称颂其父欲夺人之妻，实在匪夷所思。这句本为“连二桥于东西兮，若长空之蝃𬟽”，《三国志》裴松之注所录连“二桥”都没有。
这一点一眼就能看出是小说家言，另一点却需要一点考证。《铜雀台赋》作于铜雀台落成之时，是在赤壁之战两年以后。
赤壁之战前， #曹操真的说过要抢二乔吗？一个比较完善有理的回答大概是这样：
简单结论：这是民间传言，并无实据。然后解释为什么会有这个传言：一是《三国演义》这段的影响，二是杜牧诗“东风不与周郎便，铜雀春深锁二乔”，三是曹操形象不好，还确实喜欢人妻。
这个逻辑并不复杂，#LLM s能理解吗？

Edited 1y ago

1 0 0 View Post & Replies See Original

1y

2022年11月 #ChatGPT 发布，迅速火出圈。IT界几乎每家公司都号称自己也有这个技术，但大概半年以后，模仿者才逐渐上线，并且至今还在紧紧追赶。
有几个问题：
1、为什么ChatGPT之前没人说，之后大家都说有？
2、如果本来就有，为什么过了半年才拿出产品？
我想之前确实很多公司都有 #LLM 技术，但都是作为一个 #NLP 工具，没人相信能达到这个效果，所以没有投入很多资源。一旦知道这条路能走通，就开始争先恐后地买卡、训练了。
但他们还是低估了从技术到产品这个过程的复杂程度，所以花了很长时间才能面世，而且也一直追不上ChatGPT 。

Edited 1y ago

0 0 0 View Post & Replies See Original

306d

Postgres as a search engine
https://anyblockers.com/posts/postgres-as-a-search-engine
https://news.ycombinator.com/item?id=41343814

Build a retrieval system with semantic, full-text, & fuzzy search in Postgres to be used as a backbone in RAG pipelines.

We’ll combine 3 techniques:

* full-text search with tsvector
* semantic search with pgvector
* fuzzy matching with pg_trgm

* bonus: BM25

https://en.wikipedia.org/wiki/Okapi_BM25
https://blog.paradedb.com/pages/elasticsearch_vs_postgres
https://news.ycombinator.com/item?id=41173288

#RDBMS #postgres #PostgreSQL #databases #search #SearchEngine #LLM #RAG #BM25

Edited 306d ago

0 0 1 View Post & Replies See Original

38d

Programming properly should be regarded as an activity by which the programmers form or achieve a certain kind of insight, a theory, of the matters at hand. This suggestion is in contrast to what appears to be a more common notion, that programming should be regarded as a production of a program and certain other texts.

Peter Naur in Programming As Theory Building, 1985.

A computer program is not source code. It is the combination of source code, related documents, and the mental understanding developed by the people who work with the code and documents regularly. In other words a computer program is a relational structure that necessarily includes human beings.

The output of a generative AI model alone cannot be a computer program in this sense no matter how closely that output resembles the source code part of some future possible computer program. That the output could be developed into a computer program over time, given the appropriate resources to do so, does not make it equivalent to a computer program.

#AI #GenAI #GenerativeAI #LLM #Copilot #AgenticCoding #dev #tech #SoftwareDevelopment #SoftwareEngineering #programming #coding

Edited 38d ago

1 0 0 View Post & Replies See Original

327d

TECH BROs v. THE OCEAN:

CRYPTO: let's boil the oceans to create fake money for criminals. We'll fleece the rubes and make miillions. LOL you own that JPG now. Sure you do.

LLMs: let's boil the oceans to create pure garbage out of people's intellectual property. We'll steal from everyone and make millions.

THE OCEANS: brb making some hurricanes

#crypto #llm #ai #techbros #climate #idiots

Edited 327d ago

0 0 0 View Post & Replies See Original

124d

I have what I think is a good example of how useless ‘AI’ is for understanding. I am tagging widely. I searched “how to identify mushrooms” on DuckDuckGo, which then so helpfully spammed my screen with this lovely advice (see image with alt text). The source of much of my knowledge is mushroomexpert.com, managed by Michael Kuo.

“A mushroom is identified by its characteristics”. I could get semantic here too about the definition of a mushroom, but talk about a pretty useless statement. Fine though. That’s well enough and good if you want an explanation that is super entry level. That’s not necessarily a bad thing, though I don’t remember telling the ‘AI’ that I wanted only entry level information.

Then it talks about the danger in attempting to ID mushrooms because of the potential for poisoning. It tacitly assumes that my wanting to ID a mushroom means I want to eat it. I don’t. I just like mushrooms. I have a problem with the whole ‘some are poisonous’ throw-in, like its something their lawyers required them to include. How many are poisonous? 90%? 5%? We have no idea, and that’s OK. I didn’t tell the ‘AI’ that I wanted information on whether or not they were poisonous. But, as I’ll get to, the fact that this is included is not my problem. My problem is what they don’t include.

I think mushrooms are awesome. I think the fact that some of them are poisonous is relevant only based on the human-centric assumptions ‘AI’ is so obsessed with and what it’s dataset is built on. I don’t see the value in a mushroom based on whether or not I can eat it, and it chaffs me that they don’t also include any information about their ecological roles. You know what is a great way to identify a mushroom (including if I want to eat it)?!?!?! Their ecology (essentially, their ‘behavior’)!!! Let’s be sure to not mention that, #TechBros.

Ok let’s keep going, cause we’ve made it this far. It suggests talking to a #mycologist. It turns out that I don’t have any experienced mycologists on call. Mycologists are helpful but busy people. And I’m more likely than most of the population to know mycologists. You might as well say, ‘don’t bother trying to ID the mushroom’. Way to kill my interest immediately in something I’m trying to get into. If you really want to learn to ID mushrooms for foraging, there are sources you can look up to help you.

I’ll get to my main point. Identification of certain mushroom forming fungi to species is essentially impossible. Look up Amanitas or Russulas on mushroomexpert.com (phenomenal source, old school blogging). There is no clear delineating of what a mushroom forming species even is. Scientists argue over and reclassify bird subspecies all the time. Imagine the black box that is mushroom forming fungi, which most of the time is a web of single-cell wide threads hidden in the soil. Some mushrooms historically were ‘IDed’ (scientifically) by taste or color, which as you all know everyone experiences these things the same, all the time. And, darnit, I happened to leave my DNA sequencing kit at home (as if there aren’t issues with classifying mushroom forming fungi on their DNA alone).

If ‘AI’ were functional, to me, it would include the suggestion that one option is, instead of focusing on species, focus on species groupings (this also applies to foraging for mushrooms if done thoughtfully). Species groupings can be more useful, as is sometimes saying: “I don’t need to know exactly what this is. I’ll just focus on it’s ecology instead of obsessing over an arbitrary definition”. This nuance is not something that can be corrected with better algorithms or more training data (in fact, its going to get worse), because #LLM s are designed to spit out the lowest common denominator.

In the end, given all the questions I brought up, the biggest problems I have with ‘AI’ is that it falsely assumes something gigantic about the question I am asking and gives a simplified and highly misleading perception of how much we actually know. I think it makes a big mistake assuming that I am uncurious and want a bare-minimum answer. And when it comes to the grand total of all there is to know about mushroom forming fungi, we know next to nothing. Of course, 'AI' cannot say that because 'AI' doesn't know what it doesn't know.

You know who can identify and communicate all of these nuances? Humans.

#nature #mushrooms #fungi #AI #technology #artificialIntelligence #ecology #solarPunk #EcologicalReciprocity

AI answer: to identify a mushroom, observe its physical characteristics such as cap shape, color, and gill structure, and take a spore print to determine its spore color. Its important to consult a reliable field guide, and if possible, seek guidance from an experienced mycologist, as many mushrooms can look similar and some are poisonous.

ALT

0 0 0 View Post & Replies See Original

152d

DeepSeek launched a free, open-source large-language model in late December, claiming it was developed in just two months at a cost of under $6 million — a much smaller expense than the one called for by Western counterparts.

These developments have stoked concerns about the amount of money big tech companies have been investing in AI models and data centers, and raised alarm that the U.S. is not leading the sector as much as previously believed.

The "Western counterparts" are claiming training a model might take years and billions of dollars. This has always been a hyped-up grift, with snake oil salesmen and con artists being showered with money and power. It's really quite amazing how profoundly unintelligent "the market" is in practice.

The sad reality is that the US could lead in this field (1), if we'd stop routinely putting narcissists and con artists in charge and showering them with praise even when they fail.

From https://www.cnbc.com/2025/01/27/nvidia-falls-10percent-in-premarket-trading-as-chinas-deepseek-triggers-global-tech-sell-off.html

#AI #GenAI #GenerativeAI #LLM #SnakeOil #hype #grift #MarketCapitalism

(1) Putting aside whether we should, which is an important question.

1 0 0 View Post & Replies See Original

158d

Sabot in the Age of AI

Here is a curated list of strategies, offensive methods, and tactics for (algorithmic) sabotage, disruption, and deliberate poisoning.

🔻 iocaine
The deadliest AI poison—iocaine generates garbage rather than slowing crawlers.
🔗 https://git.madhouse-project.org/algernon/iocaine

🔻 Nepenthes
A tarpit designed to catch web crawlers, especially those scraping for LLMs. It devours anything that gets too close. @aaron
🔗 https://zadzmo.org/code/nepenthes/

🔻 Quixotic
Feeds fake content to bots and robots.txt-ignoring #LLM scrapers. @marcusb
🔗 https://marcusb.org/hacks/quixotic.html

🔻 Poison the WeLLMs
A reverse-proxy that serves diassociated-press style reimaginings of your upstream pages, poisoning any LLMs that scrape your content. @mike
🔗 https://codeberg.org/MikeCoats/poison-the-wellms

🔻 Django-llm-poison
A django app that poisons content when served to #AI bots. @Fingel
🔗 https://github.com/Fingel/django-llm-poison

🔻 KonterfAI
A model poisoner that generates nonsense content to degenerate LLMs.
🔗 https://codeberg.org/konterfai/konterfai

iocaine

> The deadliest poison known to AI.

This is a tarpit, modeled after Nepenthes, intended to catch unwelcome web crawlers, but with a slightly different, more aggressive intended usage scenario. The core idea is to configure a reverse proxy to serve content generated by iocaine to AI crawlers, but normal content to every other visitor. This differs from Nepenthes, where the idea is to link to it, and trap crawlers that way. Not with iocaine, where the trap is laid by the reverse proxy.

iocaine does not try to slow crawlers. It does not try to waste their time that way - that is left up to the reverse proxy. iocaine is purely about generating garbage.

ALT

0 0 2 View Post & Replies See Original

150d

OpenAI 称有证据 DeepSeek 使用「模型蒸馏」技术利用其模型进行训练。

- The Verge 的文章还提到，考虑到是 OpenAI 开盗用互联网数据训练其模型之先河，这一指控颇具有讽刺性。
- 404 Media 的讽刺则更为直接："Hahahahahahahahahahahahahahahaha hahahhahahahahahahahahahahaha"。

theverge.com/~
404media.co/~
seealso: HackerNews:42865527

#LLM #OpenAI #DeepSeek #today

Telegram 原文

0 0 1 View Post & Replies See Original

22d

Did you know? #Fedify provides #documentation optimized for LLMs through the llms.txt standard.

Available endpoints:

https://fedify.dev/llms.txt — Core documentation overview
https://fedify.dev/llms-full.txt — Complete documentation dump

Useful for training #AI assistants on #ActivityPub/#fediverse development, building documentation chatbots, or #LLM-powered dev tools.

#fedidev

0 0 1 View Post & Replies See Original

147d

If you turn the sink off when you're done using it to conserve water, or turn the lights off when you leave the room to conserve electricity, why do you use ChatGPT or other AI tools? Using those sorts of tools a few times a month negates whatever you've conserved by being prudent in other parts of your life.

#AI #GenAI #GenerativeAI #LLM #waste #environment

0 0 0 View Post & Replies See Original

143d

Resistance to the coup is the defense of the human against the digital and the democratic against the oligarchic.

From https://snyder.substack.com/p/of-course-its-a-coup

Defense of the human against the digital has been my mission for some time. Resisting the narratives about how #LLMs "reason", "pass the Turing test", "diagnose illnesses", are "better than humans" in various ways are part of it. Resisting the false narrative that we're on the verge of discovering #AGI is part of it. Allowing these false stories to persist and spread means succumbing to very dark anti-human forces. We're seeing some of the consequences now, and we're seeing how far this might go.

#USPol #AI #GenAI #GenerativeAI #LLM #AGI

0 0 0 View Post & Replies See Original

22d

yo fedi, how are you today?

I'm trying to find the absolute best, well-argumented (!) #LLM takedown blogpost you've read lately.. preferably related to software development.

Unfortunately my boss is on the hype-train (random glory posts in slack) and I'd like to counter him just a wee bit. Bring it on 😁

Boosting appreciated for reach

1 0 1 View Post & Replies See Original

22d

On the limits of LLMs (Large Language models) and LRMs (Large Reasoning Models). The TL;DR: "Our findings reveal fundamental limitations in current models: despite sophisticated self-reflection mechanisms, these models fail to develop generalizable reasoning capabilities beyond certain complexity thresholds." Meaning: accuracy collapse.

Interesting paper from Apple. https://ml-site.cdn-apple.com/papers/the-illusion-of-thinking.pdf

#AI #LLM #LRM

• We question the current evaluation paradigm of LRMs on established math benchmarks and
design a controlled experimental testbed by leveraging algorithmic puzzle environments that enable
controllable experimentation with respect to problem complexity.
• We show that state-of-the-art LRMs (e.g., o3-mini, DeepSeek-R1, Claude-3.7-Sonnet-Thinking)
still fail to develop generalizable problem-solving capabilities, with accuracy ultimately collapsing
to zero beyond certain complexities across different environments.
• We find that there exists a scaling limit in the LRMs’ reasoning effort with respect to problem
complexity, evidenced by the counterintuitive decreasing trend in the thinking tokens after a
complexity point.
• We question the current evaluation paradigm based on final accuracy and extend our evaluation
to intermediate solutions of thinking traces with the help of deterministic puzzle simulators. Our
analysis reveals that as problem complexity increases, correct solutions systematically emerge at
later positions in thinking compared to incorrect ones, providing quantitative insights into the
self-correction mechanisms within LRMs.
• We uncover surprising limitations in LRMs’ ability to perform exact computation, including their
failure to benefit from explicit algorithms and their inconsistent reasoning across puzzle types.

ALT

Edited 21d ago

0 0 1 View Post & Replies See Original

17d

Senators Demand Transparency on Canceled Veterans Affairs Contracts
—

Following a ProPublica investigation into how DOGE had developed an error-prone AI tool to determine which VA contracts should be killed, a trio of lawmakers said the Trump administration continues to “stonewall” their requests for details.

https://www.propublica.org/article/doge-ai-veterans-affairs-canceled-contracts-senators-trump?utm_source=mastodon&utm_medium=social&utm_campaign=mastodon-post

#News #DOGE #Veterans #VA #AI #ArtificialIntelligence #LLM #Technology #Government

1 0 0 View Post & Replies See Original

12d

Regarding the last couple boosts: among other downsides, LLMs encourage people to take long-term risks for perceived, but not always actual, short-term gains. They bet the long-term value of their education on a chance at short-term grade inflation, or they bet the long-term security and maintainability of their software codebase on a chance at short-term productivity gains. My read is that more and more data is suggesting that these are bad bets for most people.

In that respect they're very much like gambling. The messianic fantasies some ChatGPT users have been experiencing fits this picture as well.

#AI #GenAI #GenerativeAI #LLM #tech #dev #ChatGPT #GPT #Gemini #GamblingAddiction #nihilism

Edited 12d ago

0 0 0 View Post & Replies See Original

12d

One definition of the word "artifice" is: crafty device; an artful, ingenious, or elaborate trick. One would be fully justified interpreting the phrase "artificial intelligence" as an elaborate trick resembling intelligence.

#AI #GenAI #GenerativeAI #LLM

0 0 0 View Post & Replies See Original

101d

Got a website?

Feel like helping make unauthorized LLM scrapers choke on an infinite sea of garbage, potentially making their models collapse?

...Then take a look at:
https://zadzmo.org/code/nepenthes/

PS Do make sure to read the warnings, boost and have fun! 😈

.

Thanks to @dlatchx for reminding me where to find this!

#AI #LLM #Nepenthes #LLMPoison #GenAI #Markov

Edited 101d ago

0 0 0 View Post & Replies See Original

99d

Several of my papers are in that LibGen database Meta used.

I feel a bunch of ways about it, but one way I feel is that it adds insult to injury. In all but two cases I was required to sign an onerous agreement to get the paper published, handing over rights to a publisher that is continuing to abuse this arrangement (in my view). I did that begrudgingly because I was early in my career and didn't think I had another option. Later I experimented with refusing to sign these agreements and publishers walked back the terms somewhat (I don't know if that's possible now).

I also feel that the Meta computer scientists responsible for this betrayed their own colleagues, which I find pretty scummy.

Anyway, I don't consent to any of this. It's been imposed on me and countless other authors.

#LibGen #meta #LLM #AI #GenAI #GenerativeAI

0 0 0 View Post & Replies See Original