flashontrack: hallucinations

lunedì 7 aprile 2025

# life: detecting hallucinations (in large language models) using semantic entropy.

<< Large language model (LLM) systems, such as ChatGPT or Gemini, can show impressive reasoning and question-answering capabilities but often ‘hallucinate’ false outputs and unsubstantiated answers. >>

<< Here (AA) develop new methods grounded in statistics, proposing entropy-based uncertainty estimators for LLMs to detect a subset of hallucinations— confabulations— which are arbitrary and incorrect generations. (Their) method addresses the fact that one idea can be expressed in many ways by computing uncertainty at the level of meaning rather than specific sequences of words. >>

Their method << works across datasets and tasks without a priori knowledge of the task, requires no task-specific data and robustly generalizes to new tasks not seen before. By detecting when a prompt is likely to produce a confabulation, helps users understand when they must take extra care with LLMs and opens up new possibilities for using LLMs that are otherwise prevented by their unreliability. >>️️

Sebastian Farquhar, Jannik Kossen, et al. Detecting hallucinations in large language models using semantic entropy. Nature 630, 625–630. Jun 19, 2024.

https://www.nature.com/articles/s41586-024-07421-0

Also: ai (artificial intell) (bot), entropy, in https://www.inkgmr.net/kwrds.html

Keywords: life, artificial intelligence, LLMs, confabulations, uncertainty, hallucinations, entropy, semantic entropy

giovedì 20 marzo 2025

# aibot: I think, therefore I hallucinate: minds, machines, and the art of being wrong.

<< This theoretical work examines 'hallucinations' in both human cognition and large language models, comparing how each system can produce perceptions or outputs that deviate from reality. Drawing on neuroscience and machine learning research, (AA) highlight the predictive processes that underlie human and artificial thought. >>

<< In humans, complex neural mechanisms interpret sensory information under uncertainty, sometimes filling in gaps and creating false perceptions. This inference occurs hierarchically: higher cortical levels send top-down predictions to lower-level regions, while mismatches (prediction errors) propagate upward to refine the model. LLMs, in contrast, rely on auto-regressive modeling of text and can generate erroneous statements in the absence of robust grounding. >>

<< Despite these different foundations - biological versus computational - the similarities in their predictive architectures help explain why hallucinations occur. (AA) propose that the propensity to generate incorrect or confabulated responses may be an inherent feature of advanced intelligence. In both humans and AI, adaptive predictive processes aim to make sense of incomplete information and anticipate future states, fostering creativity and flexibility, but also introducing the risk of errors. (Their) analysis illuminates how factors such as feedback, grounding, and error correction affect the likelihood of 'being wrong' in each system. (AA) suggest that mitigating AI hallucinations (e.g., through improved training, post-processing, or knowledge-grounding methods) may also shed light on human cognitive processes, revealing how error-prone predictions can be harnessed for innovation without compromising reliability. By exploring these converging and divergent mechanisms, the paper underscores the broader implications for advancing both AI reliability and scientific understanding of human thought. >>️

Sebastian Barros. I Think, Therefore I Hallucinate: Minds, Machines, and the Art of Being Wrong. arXiv: 2503.05806v1 [q-bio.NC]. 4 Mar 4, 2025.

https://arxiv.org/abs/2503.05806

Also: brain, curiosity, novelty, uncertainty, error, mistake, jazz, ai (artificial intell), in https://www.inkgmr.net/kwrds.html

Keywords: brain, cognition, perceptions, curiosity, novelty, hallucinations, errors, prediction, prediction errors, error-prone predictions, AI, artificial intelligence, LLMs

flashontrack

Translate

lunedì 7 aprile 2025

# life: detecting hallucinations (in large language models) using semantic entropy.

giovedì 20 marzo 2025

# aibot: I think, therefore I hallucinate: minds, machines, and the art of being wrong.

StatCounter