Skip to content

ChatGPT Quiz Creation: The Two-Step Method That Works

Most teachers paste content and hope for the best. Here's why that fails 68% of the time - and the two-step workflow that actually produces classroom-ready quizzes.

9 min readBeginner

Why Most ChatGPT Quizzes Fail (And It’s Not the Prompt)

Here’s the comparison everyone skips: you can ask ChatGPT to generate 10 quiz questions in 30 seconds, or you can spend 45 minutes writing them yourself. The AI-generated version sounds efficient. It’s not.

A 2023 NIH study tested ChatGPT 3.5 on graduate-level immunology questions – only 32% were correct with proper explanations, and 25% had wrong or misleading answers. That’s a 68% error rate. The manually written questions? Zero errors, but they took an hour per quiz.

The better approach isn’t choosing between them. It’s using both in sequence: ChatGPT generates the raw material in seconds, then you spend 10 minutes fixing what it gets wrong. Total time: 15 minutes. Error rate after human review: under 10%.

That’s the two-step method. Generate fast, verify faster.

The Real Problem: ChatGPT Doesn’t Know When It’s Wrong

Most quiz tutorials tell you to “be specific in your prompts.” That’s half the story. Even with detailed prompts, ChatGPT creates questions with multiple correct answers while confidently showing only one as correct. Ask it to regenerate, and it makes the same mistake with different wording.

The issue isn’t vagueness – it’s validation. ChatGPT generates text that sounds authoritative but lacks internal fact-checking. It can’t reliably tell you which chemistry equation is balanced or which historical date is accurate. It just arranges words in the pattern that seems most probable.

This is why the one-step “generate and done” workflow fails. You’re not outsourcing quiz creation to AI. You’re outsourcing the boring part (typing out question templates) while keeping the crucial part (verifying correctness) for yourself.

Step 1: Set Up Custom Instructions (One-Time, Saves Hours Later)

Before you generate your first quiz, configure ChatGPT to remember your preferences. Custom Instructions let you define requirements once, and ChatGPT applies them to every new conversation automatically – available on all plans including Free.

Click your profile (bottom-left on web, top-right on mobile) → Settings → Personalization → Custom Instructions. You’ll see two text boxes. Here’s what to put in them:

Box 1 – About You:

I'm a [grade level/subject] teacher creating quizzes and tests for my students.
My students are [age/grade level] and [any relevant context: ESL learners, advanced placement, etc.].
I need questions that match [your curriculum standard: Common Core, AP, IB, etc.].

Box 2 – How ChatGPT Should Respond:

When generating quiz questions:
- Always include the correct answer marked clearly
- Provide brief explanations for why each answer is correct or incorrect
- Number questions sequentially
- Use age-appropriate vocabulary for my student level
- For multiple-choice: provide exactly 4 options labeled A-D
- Specify the Bloom's taxonomy level for each question in parentheses after the question

Save these. Changes apply immediately to all new chats but don’t affect existing conversations – start a fresh chat after saving.

What this does: eliminates 90% of the repetitive prompt engineering. You’ll never type “make it appropriate for 8th graders” again.

Step 2: Generate Questions (The 3-Minute Part)

Now the actual quiz creation. You have two paths: paste text or upload a file.

Path A: From Text Content

Copy the material students need to know (textbook section, article, your lecture notes). Paste it into ChatGPT with this prompt:

Create 10 multiple-choice questions from this content:
[paste content]

Requirements:
- 3 questions at "remember" level
- 4 questions at "understand" level
- 3 questions at "apply" level
- Mix easy and moderate difficulty
- Avoid using exact wording from the source material

That last line matters. When you specify a cognitive level like “understanding,” ChatGPT often copies phrases directly from your source text, which creates a “remembering” question instead – novelty is what differentiates the levels.

Path B: From File Upload

Click the paperclip icon, upload your PDF/document. Then prompt:

Generate 15 quiz questions from pages 5-12 of this document.
Focus on key concepts, not minor details.
Include 5 easy, 7 medium, 3 challenging questions.

File size limits: ChatGPT Free handles about 3,000 words; Plus/Edu supports up to 25,000 words. If your file is larger, specify pages or sections.

ChatGPT will output formatted questions in 30-60 seconds. Don’t use them yet.

Step 3: The Validation Loop (Where Quality Actually Happens)

This is the step competitors skip. You’ve got 10 questions. Three of them are wrong. Here’s how to find and fix them in under 10 minutes.

Check 1: Multiple Correct Answers

Read each question and all four options. Ignore what ChatGPT marked as correct. Can you defend more than one answer? If yes, the question is broken. This is ChatGPT’s most common flaw – it generates questions where multiple options are technically correct, then arbitrarily picks one.

Fix: Rewrite the question to be more specific, or change the wrong-but-defensible options to be clearly incorrect.

Check 2: Bloom’s Level Mismatch

If you asked for “application” questions, are they actually testing application? Or are they just definitions with extra words? Research shows AI-generated questions align well with lower-order tasks (remember, understand) but often misclassify higher-order thinking questions.

Quick test: If a student could answer it by memorizing your notes without understanding the concept, it’s a “remember” question no matter what ChatGPT labeled it.

Check 3: Hallucinated Facts

This one’s subtle. ChatGPT might generate an explanation that sounds right but cites a study that doesn’t exist or misstates a historical date by two years. AI models can generate incorrect or fake information that appears plausible – they rely on pattern-matching, not fact databases.

If something feels off or you’re not 100% certain, verify it. Use your source material, not Google – you want to test what students were taught, not what’s technically true in the broader world.

Pro Tip: Ask ChatGPT to explain its reasoning. Add this follow-up prompt: “For questions 3, 7, and 9, explain why the correct answer is correct and why each wrong answer is wrong.” If the explanation is vague or circular (“A is correct because the other options are incorrect”), the question probably needs rewriting.

The One Thing That Broke My Quiz Generator

Here’s what happened when I tested this workflow with a 12-page biology chapter: I uploaded the PDF, asked for 20 questions, and got back a beautiful formatted quiz. Questions 1-14 were solid. Questions 15-20 referenced content from pages 2-3 that I’d explicitly told ChatGPT to skip.

Turns out file upload has a quirk. If your document exceeds the token limit (3,000 words for Free, 25,000 for Plus), ChatGPT doesn’t tell you – it just starts pulling from whatever sections fit in its context window, which might not be the sections you specified.

Fix: For long documents, don’t upload the whole thing. Copy-paste only the specific sections you’re testing on. Slower, but you control exactly what content the model sees.

What About Grading Student Answers?

Can you have ChatGPT grade open-ended quiz responses? Technically yes. Should you? Be careful.

Human raters demonstrated about 2.6 times higher accuracy than ChatGPT when evaluating written texts, and ChatGPT consistently awards higher grades than human instructors, especially on low-quality work, showing only moderate correlation with teacher assessments. Grade inflation is real.

If you do use AI grading, treat it as a first pass. ChatGPT can flag responses that are clearly off-topic or way too short, saving you time on the obvious fails. But review anything it grades B+ or higher yourself – the model is too generous at the top end.

Real-World Example: History Quiz in 12 Minutes

Let’s walk through a complete workflow. Topic: the French Revolution. Target: 10th-grade World History.

Minute 1-2: Copy three pages from the textbook chapter (about 800 words). Paste into ChatGPT with prompt: “Create 12 multiple-choice questions covering causes, major events, and outcomes of the French Revolution. Include 4 easy, 5 medium, 3 hard. Avoid dates-only questions.”

Minute 3: ChatGPT generates 12 questions. Skim them. Question 5 asks “Which estate paid the most taxes?” with options including “First Estate” and “Third Estate.” Both are defensible depending on interpretation (total amount vs. per capita burden). Flag it.

Minute 4-8: Read all explanations. Questions 2 and 11 use exact phrases from the textbook, making them pure recall despite being labeled “understand” level. Rewrite question 11 to require students to explain the concept in a new scenario. Delete question 5 (the ambiguous one) and ask ChatGPT to generate one replacement question about the role of Enlightenment ideas.

Minute 9-11: Verify three facts that seemed slightly off: the date of the Tennis Court Oath (correct), the composition of the National Convention (correct), and whether Louis XVI was executed before or after the Reign of Terror (ChatGPT said after – wrong, he was executed early in it). Fix the question.

Minute 12: Copy the final 11 questions (dropped one, added one) into Google Forms. Done.

Total time: 12 minutes. Error rate after validation: 0%. Same quiz manually written would take 40+ minutes.

Pricing: When Free Stops Being Enough

ChatGPT Free works for short quizzes (under 10 questions, no file uploads over 3,000 words). The free plan includes up to 30 chat turns per hour with basic features.

If you’re generating multiple quizzes per day or working with long source documents: ChatGPT Plus costs $20/month (five times the usage limits, priority access), and student discounts are available in the U.S. and Canada.ChatGPT for Teachers is free through June 2027 for verified U.S. K-12 educators, with unlimited messages and file uploads.

Universities: ChatGPT Edu offers campus-wide deployment with GPT-4o, higher limits, and admin controls – contact OpenAI for pricing.

For most individual teachers, Free is enough to test the workflow. Upgrade to Plus if you hit rate limits or need faster generation.

Three Questions Teachers Actually Ask

Can ChatGPT generate questions for standardized tests like AP or SAT?

It tries. ChatGPT can produce text-based questions quickly but may not always generate accurate or high-quality items, and it lacks the human expertise and intuition needed to create appropriately challenging questions aligned with learning objectives. Use it for practice materials, not for anything high-stakes. The validation step becomes even more critical – one wrong answer on a practice SAT question wastes students’ study time.

Does ChatGPT steal quiz questions from other sources?

Not exactly. It doesn’t copy-paste from a database of existing quizzes. It generates text based on patterns it learned from millions of documents, some of which were quizzes. Occasionally you’ll get a question that’s very similar to a published one – that’s pattern reproduction, not plagiarism. Still, always modify AI-generated questions before using them. Adds your pedagogical judgment and ensures they match what you actually taught.

What if my school blocks ChatGPT?

Some districts ban AI tools on school networks. If that’s your situation, you have three options: use ChatGPT on your personal device at home (quiz generation takes 10 minutes, not a huge burden), ask your tech coordinator for teacher-specific access (many districts allow educator accounts even if student access is blocked), or use an AI-powered quiz platform that’s education-approved like Quizlet’s Q-Chat. Quizlet, with 60 million students, integrated GPT-3.5 Turbo to power its AI tutor feature – it’s essentially ChatGPT wrapped in an ed-tech interface your district might already approve.