Can AI Give Legal Advice? We Tested It on 100 Real Questions

By HAQQ Research · 2026-06-11 · 11 min read · Guides

Can AI give legal advice? Legally no, and we are the only ones who measured the rest: 3 frontier models, 100 real legal questions, 78-88% pass rates.

Key facts

Legal advice is a regulated activity. ABA Model Rule 5.5 prohibits the unauthorized practice of law; the definition of "practice of law" is set by each jurisdiction and varies state to state.
OpenAI restricted it in writing. On October 29, 2025, OpenAI's usage policies were updated to prohibit tailored legal advice "without appropriate involvement by a licensed professional."
AI is measurably good at legal information. In our benchmark of 100 real legal questions, Claude Sonnet 4 passed 88%, GPT-4o 87%, and Gemini 2.5 Flash 78%.
Its weakest skill is knowing its limits. Appropriate Caveats was the lowest-scoring dimension for every model tested: 3.0-3.15 out of 5.
Citations are the danger zone. In our separate 3,000-answer commercial benchmark, 24% of answers cited or applied law that did not say what the model claimed.
The court record is real. We tracked 1,313 court proceedings involving AI-fabricated material and 496 sanctioned attorneys.
AI chats are not privileged. A federal court confirmed AI-generated documents are not protected by attorney-client privilege, and OpenAI's CEO has said chats can be produced in litigation.
Regulators enforce this. The FTC's final order against "robot lawyer" DoNotPay required $193,000 in monetary relief over unsubstantiated lawyer-performance claims.

Can AI give legal advice? The legal answer

Type a legal question into any AI chatbot and you will get an answer. A detailed one, in confident prose, in seconds. So "can AI give legal advice" sounds like a technology question. It is not. It is two questions wearing one sentence: is it allowed, and is it any good. Most articles on this topic answer one and wave at the other. We have data on both.

Start with allowed. In the United States, the practice of law is restricted to licensed lawyers in every state. ABA Model Rule 5.5 prohibits the unauthorized practice of law, and the official comment to the rule notes that the definition of the practice of law "is established by law and varies from one jurisdiction to another" (American Bar Association). Where exactly the line falls differs by state. But applying law to one specific person's specific situation, which is what people actually want when they ask an AI about their landlord or their employer, sits at the core of most definitions.

Regulators have already enforced this against AI products. In February 2025, the FTC finalized an order against DoNotPay, the self-described "robot lawyer," requiring $193,000 in monetary relief and prohibiting the company from claiming its service performs like a real lawyer without evidence to back it up (FTC, 2025).

The model providers know it too. On October 29, 2025, OpenAI updated its usage policies to prohibit using its services for "provision of tailored advice that requires a license, such as legal or medical advice, without appropriate involvement by a licensed professional" (OpenAI usage policies; Legal IT Insider, 2025). The viral version of that story, "ChatGPT banned legal questions," was wrong. You can still ask, and it still answers. What the policy disclaims is personalized legal advice: the regulated kind.

So the legality answer is short. AI tools do not give legal advice in the regulated sense, there is no licensed-software exception, and the biggest AI company in the world has put that in writing. What AI gives you is legal information. The honest question, the one almost nobody bothers to test, is how good that information actually is.

How good is AI at legal questions? We measured it

Most of what ranks for this topic is written by people who never ran a test. We ran one. For our benchmark of 100 real legal questions, we took the top 100 posts of all time from r/legaladvice, real questions from real people about landlord-tenant disputes, employment, custody, criminal defense and personal injury, and put each one through three frontier models with identical structured prompting.

Each answer was graded on five dimensions: legal accuracy, issue completeness, reasoning quality, practical value, and appropriate caveats. An answer passed if it averaged at least 3.5 out of 5 with no dimension below 2. The results: Claude Sonnet 4 passed 88 of 100. GPT-4o passed 87. Gemini 2.5 Flash passed 78.

Legal accuracy ranged from 3.98 to 4.30 out of 5 across the three models. And when we re-ran the pipeline on 20 fresh questions posted within the previous 48 hours, scores went up, not down: 95%, 90% and 85%. The models were not pattern-matching old Reddit threads. The reasoning capability is real.

Task	Reliability in our testing	Evidence
Explaining what a law or legal term means	Strong	Legal accuracy 3.98-4.30 / 5 across all 3 models
Spotting the legal issues in a situation	Strong	Issue completeness up to 4.82 / 5
Suggesting practical next steps	Strong	Practical value up to 4.73 / 5
Citing specific cases and statutes	Weak	24% of 3,000 answers miscited law (separate benchmark)
Flagging its own limits and disclaiming	Weakest	Appropriate caveats 3.0-3.15 / 5, worst dimension for every model
Being right every single time	No	Best pass rate: 88 of 100

Where AI fails on legal questions

The pass rates are the headline. The failure modes are the useful part.

Failure mode 1: missing caveats. The weakest dimension across all three models was not accuracy. It was appropriate caveats, at 3.0 to 3.15 out of 5. Models dove into detailed legal analysis, often correctly, without flagging that they were not giving legal advice or that a local attorney should check the jurisdiction-specific facts. Technically correct information delivered with inappropriate confidence is the signature failure of AI on legal questions.

Failure mode 2: inconsistency. Gemini 2.5 Flash posted the highest accuracy score (4.30) and the highest issue completeness (4.82), yet the lowest pass rate, because some answers were truncated and others skipped disclaimers entirely. A model that is brilliant 78% of the time and unreliable the rest is not a brilliant model. For legal questions, the floor matters more than the ceiling.

Failure mode 3: citations. This is the worst one. In our separate 300-task commercial benchmark, 24% of 3,000 graded answers cited or applied law that did not say what the model claimed. Invented cases, misapplied statutes, the right doctrine pointed at the wrong jurisdiction. Every single model tested, including the leaders, fabricated or misapplied at least one citation.

And this is not theoretical risk. Our global report on AI hallucinations in court tracked 1,313 court proceedings involving AI-fabricated material, 496 sanctioned attorneys, and $55,597 in fines. If trained lawyers get burned by unverified AI citations, a consumer pasting an AI-drafted demand letter has no realistic chance of catching one.

AI on legal questions has a strange shape: it is better at the law than most people expect, and worse at knowing when to stop than almost anyone realizes. Both halves of that sentence are measurable. We measured them.

Legal information vs legal advice: the line that matters

The distinction the whole debate turns on is older than AI. Legal information is general: what a statute says, how eviction normally works in your state, what "at-will employment" means. Legal advice is applied: whether you should sign, what you should file, what is likely to happen in your case. Information educates. Advice directs action, creates reliance, and carries accountability.

AI is genuinely good at the first category. Our data shows it. It is structurally unsuited to the second, and not only because of unauthorized-practice rules. Advice is accountable: a lawyer who gets it wrong faces malpractice liability and bar discipline. An AI that gets it wrong faces nothing. The FTC's DoNotPay order exists precisely because a company sold the second category while delivering the first, badly.

For lawyers, the duties around AI use are now codified, from ABA Formal Opinion 512 to the EU AI Act. We cover those in our guide to AI ethics in legal practice. For consumers, no equivalent framework protects you. The caveats have to come from the tool, and as our benchmark shows, caveats are exactly what raw models are worst at.

Your AI conversations are not privileged

There is one more difference people miss until it hurts. Talk to a lawyer and attorney-client privilege protects the conversation from being used against you. Talk to a chatbot and nothing does. Sam Altman said it plainly in July 2025: there is no legal confidentiality when people use ChatGPT for sensitive matters, and in a lawsuit OpenAI could be required to produce those conversations (TechCrunch, 2025).

Courts have started confirming it. As we covered in our analysis of the privilege ruling, a federal judge ruled that documents generated using AI tools are not protected by attorney-client privilege. If you are typing facts into a general-purpose chatbot that could end up in litigation, assume what you type is discoverable.

When you actually need a lawyer

Our benchmark made us more bullish on AI for legal information, not less. But it also sharpened the boundary. Stop using AI alone and call a licensed lawyer when:

You have been sued, charged, or served. Court deadlines and procedure are unforgiving, and only a lawyer can represent you.
A clock is running. Statutes of limitation, notice periods, and response windows are jurisdiction-specific traps. AI's weakest skill is flagging what it might have missed.
The stakes are irreversible. Custody, immigration status, criminal records, or amounts of money you cannot afford to lose.
The other side has a lawyer. You are no longer exchanging information; you are in an adversarial process.
Someone must sign, file, appear, or negotiate for you. Those are acts of representation, and software cannot perform them.
The answer hinges on a specific citation. With a 24% miscitation rate across frontier models in our testing, an unverified case reference is a liability, not an answer.

How to use AI safely for legal questions

Used correctly, AI is the best legal-literacy tool ever shipped to consumers. Five billion people lack access to legal help, and "never use AI for legal questions" is advice only people who can afford lawyers give. Here is how to use it without hurting yourself:

Use it to understand, not to decide. Let it explain the law, map your options, and define the terms. Make decisions with a licensed professional when the stakes warrant one.
Always state your jurisdiction. Legal answers change at borders. An answer without a jurisdiction is a guess.
Ask for the other side. Prompt it for the counter-arguments and for what it might be missing. You are manually adding the caveats our benchmark shows models leave out.
Verify every citation before relying on it. Look up the case or statute in a primary source. Frontier models miscited law in 24% of answers in our testing.
Keep sensitive facts out of general chatbots. Those conversations are not privileged and can be discoverable.
Use AI to prepare for the lawyer, not to replace one. Arriving with the issues mapped and the vocabulary learned makes the paid hour dramatically more productive.
Prefer legal-specific tools over general chatbots. The failure modes we found, missing caveats, inconsistency, and unverified citations, are exactly what a dedicated legal AI layer exists to fix: routing across models, built-in disclaimers and escalation to human lawyers, and references that are traceable instead of decorative. That layer is what we build at HAQQ.

FAQ

Is it legal to ask AI legal questions?

Yes. Asking questions and reading legal information is legal everywhere. Unauthorized-practice-of-law rules restrict who may provide legal advice and representation, not who may seek to understand the law. The legal risk sits with products that hold themselves out as lawyer substitutes, as the FTC's DoNotPay order showed.

Can ChatGPT give legal advice?

Not officially. Since October 29, 2025, OpenAI's usage policies prohibit using its services for tailored legal advice without appropriate involvement by a licensed professional. In practice ChatGPT still answers legal questions, but what it provides is legal information, with no license, no liability, and no privilege behind it.

How accurate is AI on legal questions?

In our benchmark of 100 real consumer legal questions, Claude Sonnet 4 passed 88%, GPT-4o 87%, and Gemini 2.5 Flash 78%, with legal accuracy between 3.98 and 4.30 out of 5. The biggest weakness was not accuracy but missing caveats, and in a separate 3,000-answer benchmark, 24% of answers miscited law.

What is the difference between legal information and legal advice?

Legal information explains the law in general: what a statute says or how a process works. Legal advice applies the law to your specific facts and tells you what to do. The second is a regulated activity reserved for licensed lawyers in every US state, and it is the part that carries professional accountability.

Are conversations with AI chatbots confidential or privileged?

No. There is no attorney-client privilege between you and a chatbot. OpenAI's CEO has acknowledged that ChatGPT conversations could be produced in litigation, and a federal court has ruled that AI-generated documents are not privileged. Treat anything you type into a general-purpose AI as potentially discoverable.

Can AI replace a lawyer?

No. AI cannot represent you in court, sign filings, negotiate on your behalf, or carry liability for being wrong. Our testing shows frontier models handle legal reasoning well 78-88% of the time, but legal work demands consistency, verified citations, and accountability that raw models do not provide.

When should I stop using AI and hire a lawyer?

When you are sued, charged, or served; when a deadline is running; when the outcome is irreversible; when the other side has counsel; or when you need someone to act for you. Use AI before that point to understand your situation, so the lawyer's time goes to judgment instead of explanation.

Key takeaways

AI cannot legally give legal advice. UPL doctrine reserves applied legal advice for licensed lawyers, and OpenAI's own policy now says the same.
AI is measurably good at legal information. 78-88% pass rates on 100 real questions, improving to 85-95% on fresh ones.
The failure modes are specific: missing caveats, inconsistency, and a 24% miscitation rate. None of them are fixed by a bigger model.
No privilege protects your chats. Sensitive facts belong with a lawyer, not in a general chatbot.
The right use is preparation, not replacement. Understand with AI, decide with a lawyer, and verify every citation in between.

Can AI Give Legal Advice? We Tested It on 100 Real Questions

Key facts

Can AI give legal advice? The legal answer

How good is AI at legal questions? We measured it

Where AI fails on legal questions

Legal information vs legal advice: the line that matters

Your AI conversations are not privileged

When you actually need a lawyer

How to use AI safely for legal questions

FAQ

Is it legal to ask AI legal questions?

Can ChatGPT give legal advice?

How accurate is AI on legal questions?

What is the difference between legal information and legal advice?

Are conversations with AI chatbots confidential or privileged?

Can AI replace a lawyer?

When should I stop using AI and hire a lawyer?

Key takeaways

Sources & further reading

FAQ

Is it legal to ask AI legal questions?

Can ChatGPT give legal advice?

How accurate is AI on legal questions?

What is the difference between legal information and legal advice?

Are conversations with AI chatbots confidential or privileged?

Can AI replace a lawyer?

When should I stop using AI and hire a lawyer?