According to Anthropic, "it currently takes a few hours of human effort to understand the circuits we see, even on prompts with only tens of words." And the research doesn't explain how the structures inside LLMs are formed in the first place.

**PKPs Powerfromspace1** @Powerfromspace1@mstdn.social · Apr 1

Apr 1

PKPs Powerfromspace1 @Powerfromspace1@mstdn.social

@itnewsbot now... they are just doing it now #ai #anthropic

**Malt** @malt@techhub.social · Apr 1

Apr 1

Malt @malt@techhub.social

#OpenAi, #Anthropic, and other #LLM model vendors are starting to look a lot like #Docker - a ubiquitous technology with no real moat and no way to avoid becoming a commodity with razor thin profit margins.
These companies will have a hard time competing with small end-user focused competitors that provide nicely packed #AI based apps for specific users and use-cases.

**AI For Business** @aiforbusiness@mastodon.social · Mar 31

Mar 31

AI For Business @aiforbusiness@mastodon.social

Who is *really* using the newest AI models?

Anthropic recent launched the Anthropic Economic Index, "aimed at understanding AI's effects on labor markets and the economy over time." In their latest post, they explore who is making the most use of their new extended thinking models.

The biggest users of the new models? Computer research scientists, software developers, multimedia artists, and video game designers!

Find out more here:

https://www.anthropic.com/news/anthropic-economic-index-insights-from-claude-sonnet-3-7
#AI #GenAI #Anthropic #GenerativeAI

**Rod2ik** @rod2ik@mastodon.social · Mar 31

Mar 31

Rod2ik @rod2ik@mastodon.social

#Anthropic vient de publier des études révélant comment son modèle #Claude "réfléchit" réellement.

Les chercheurs ont découvert que l' #IA #AI planifie ses réponses à l'avance, pense dans un #langage conceptuel #universel et peut même parfois fournir des explications qui ne reflètent pas son véritable processus interne.

https://www.lesnumeriques.com/intelligence-artificielle/comment-fonctionne-vraiment-une-ia-les-chercheurs-d-anthropic-ont-enfin-un-debut-de-reponse-n234978.html

Les Numériques · Mar 31Comment fonctionne vraiment une IA ? Les chercheurs d'Anthropic ont enfin un début de réponseBy Sofian Nouira

**Rod2ik** @rod2ik.bsky.social@bsky.brid.gy · Mar 31

Mar 31

Rod2ik @rod2ik.bsky.social@bsky.brid.gy

#Anthropic vient de publier des études révélant comment son modèle #Claude "réfléchit" réellement. www.lesnumeriques.com/intelligence...

Comment fonctionne vraiment un...

Bluesky SocialBluesky

**arun** @aruns@mastodon.social · Mar 31

Mar 31

arun @aruns@mastodon.social

A rare insight into the inner workings of large language models by Anthropic research #AI #LLM #Anthropic

https://open.substack.com/pub/deepgains/p/on-the-biology-of-a-large-language?r=8stv&utm_medium=ios

Deep Gains · Mar 31On the Biology of a Large Language Model - Anthropic ResearchBy Arun S

**KINEWS24** @KiNews@mastodon.social · Mar 31

Mar 31

KINEWS24 @KiNews@mastodon.social

So verwenden wir Künstliche Intelligenz - KI erobert den Arbeitsplatz

Software-Entwicklung dominiert
Textarbeit auf dem Vormarsch
Automatisierung noch selten

#ai #ki #artificialintelligence #kuenstlicheintelligenz #arbeitswelt #studie #anthropic

Jetzt lesen und folgen! https://kinews24.de/ki-im-arbeitsleben/

GIF

Continued thread

**Miguel Afonso Caetano** @remixtures@tldr.nettime.org · Mar 29

Mar 29

Miguel Afonso Caetano @remixtures@tldr.nettime.org

"Why do language models sometimes hallucinate—that is, make up information? At a basic level, language model training incentivizes hallucination: models are always supposed to give a guess for the next word. Viewed this way, the major challenge is how to get models to not hallucinate. Models like Claude have relatively successful (though imperfect) anti-hallucination training; they will often refuse to answer a question if they don’t know the answer, rather than speculate. We wanted to understand how this works.

It turns out that, in Claude, refusal to answer is the default behavior: we find a circuit that is "on" by default and that causes the model to state that it has insufficient information to answer any given question. However, when the model is asked about something it knows well—say, the basketball player Michael Jordan—a competing feature representing "known entities" activates and inhibits this default circuit (see also this recent paper for related findings). This allows Claude to answer the question when it knows the answer. In contrast, when asked about an unknown entity ("Michael Batkin"), it declines to answer.

Sometimes, this sort of “misfire” of the “known answer” circuit happens naturally, without us intervening, resulting in a hallucination. In our paper, we show that such misfires can occur when Claude recognizes a name but doesn't know anything else about that person. In cases like this, the “known entity” feature might still activate, and then suppress the default "don't know" feature—in this case incorrectly. Once the model has decided that it needs to answer the question, it proceeds to confabulate: to generate a plausible—but unfortunately untrue—response."

https://www.anthropic.com/research/tracing-thoughts-language-model

#AI #GenerativeAI #LLMs

**KINEWS24** @KiNews@mastodon.social · Mar 29

Mar 29

KINEWS24 @KiNews@mastodon.social

Anthropic Microscope: Revolutioniert die KI-Transparenz

Tiefe Einblicke in KI-Modelle
Verständnis der KI-Entscheidungen
Verbesserte Kontrolle über KI

#ai #ki #artificialintelligence #kuenstlicheintelligenz #Anthropic #Transparenz

Jetzt lesen und folgen!

https://kinews24.de/anthropic-microscope/

GIF

**KINEWS24** @KiNews@mastodon.social · Mar 29

Mar 29

KINEWS24 @KiNews@mastodon.social

Claude 3.7 Sonnet: KI mit 500k Kontextfenster

Revolutionäre Kontextgröße
Verbesserte Verarbeitung
Neue Möglichkeiten

#ai #ki #artificialintelligence #Claude #Anthropic

Jetzt lesen und folgen!

https://kinews24.de/claude-3-7-sonnet-bald-mit-500k-contextfenster/

GIF

**Joan Pla** @joanpla@mastodon.social · Mar 29

Mar 29

Joan Pla @joanpla@mastodon.social

"Why do LLMs make stuff up? New research peers under the hood.
Claude's faulty "known entity" neurons sometime override its "don't answer" circuitry"
https://arstechnica.com/ai/2025/03/why-do-llms-make-stuff-up-new-research-peers-under-the-hood/ #AI #LLM #Anthropic

Ars Technica · Mar 28Why do LLMs make stuff up? New research peers under the hood.By Kyle Orland

**PUPUWEB Blog** @pupuweb@mastodon.social · Mar 29

Mar 29

PUPUWEB Blog @pupuweb@mastodon.social

Anthropic researchers reveal surprising insights from observing Claude's thought process: planning ahead, confusion between safety & helpfulness goals, lying, and more. #AI #Anthropic #Claude #ArtificialIntelligence #MachineLearning #TechNews #AIResearch

**st1nger** @st1nger@infosec.exchange · Mar 28

Mar 28

st1nger @st1nger@infosec.exchange

#Anthropic - Tracing the thoughts of a #LLM #AI https://www.anthropic.com/research/tracing-thoughts-language-model

Recent searches

Search options

Administered by:

Server stats:

#anthropic