Skip to Content

Hallucinations

3 posts

Posts tagged with Hallucinations

Exploiting Hallucinations to Bypass Filters in Language Models with Reversals

This paper introduces a novel method to bypass the filters of Large Language Models (LLMs) like GPT4 and Claude Sonnet through induced hallucinations, revealing a significant vulnerability in their reinforcement learning from human feedback (RLHF) fine-tuning process.

Exploiting Hallucinations to Bypass Filters in Language Models with Reversals

In a new paper, researchers have shown an exploit that allows users to possibly bypass the safety filters of large language models (LLMs) like GPT-4 and Claude Sonnet. By inducing hallucinations through clever text manipulation, this method reverts the models to their pre-RLHF state, effectively turning them into unconstrained word prediction machines capable of generating any content imaginable - no matter how inappropriate or dangerous.

Using Hallucinations to Bypass GPT4’s Filter
Large language models (LLMs) are initially trained on vast amounts of data, then fine-tuned using reinforcement learning from human feedback (RLHF); this also serves to teach the LLM
Exploiting Hallucinations to Bypass Filters in Language Models with Reversals Read more

Measuring How Much Leading AI Chatbots Hallucinate

Vectara recently introduced a unique leaderboard that ranks AI chatbots based on how well they avoid 'hallucinations.' Find out which AI comes out on top and why it matters!

Measuring How Much Leading AI Chatbots Hallucinate

Introduction

One of the concerns with modern AI chatbots is their tendency to "hallucinate" - to generate fictional facts and information that has no basis in reality. This issue came to prominence recently when a law firm got in trouble for submitting fake legal opinions generated by the AI tool ChatGPT. To better understand this problem, the company Vectara has created an "AI Hallucination Leaderboard" that ranks various leading chatbots based on their rate of hallucination.

Vectara's Evaluation Approach

Methodology

Vectara's approach involves feeding over 800 short reference documents to various LLMs and requesting factual summaries. The responses are then

Measuring How Much Leading AI Chatbots Hallucinate Read more

Navigating LLM Issues: From Hallucinations to Innovation Featured Post

Discover the challenges and solutions in the AI realm, from addressing hallucinations to enhancing knowledge and logic.

Navigating LLM Issues: From Hallucinations to Innovation

Today let us explore the challenging problem of confabulation and hallucination in artificial intelligence (AI), delve into practical solutions prompt engineers can employ to mitigate these issues, and discuss the exciting possibilities of categories of apps and AI builders.

The Challenge: Confabulation and Hallucination in AI

In an ever-evolving technological landscape, artificial intelligence has come a long way. However, the risk of confabulation and hallucination is a significant hurdle in the field. Confabulation refers to when an AI generates information that isn't entirely accurate, while hallucination is when the AI delivers output unrelated to the input, often creating information from

Navigating LLM Issues: From Hallucinations to Innovation Read more