How to Use Google Gemini AI: What Every Tutorial Skips

Most Gemini tutorials start with 'What is Gemini?' This one starts with the weird pricing trap Google hides, the context window quirk that breaks at 900K tokens, and why the free tier beats paid for most people.

Jack Tom2026-04-1410 min readBeginner

Here’s what nobody tells you upfront: if you’re already using Google Workspace, Gemini’s $20/month Pro plan is probably worth it. If you’re not, the free tier will handle 80% of what you throw at it, and upgrading makes no sense until you actually hit a wall.

Most tutorials start with “What is Gemini?” – you already know it’s Google’s AI chatbot. Let’s skip to what actually matters.

The Free vs Paid Decision (Make This First)

Before you touch anything else, figure out which tier you need. Google AI Pro costs $19.99/month as of April 2026. Here’s the honest breakdown.

Stick with free if:

You’re asking fewer than 50 questions per day with the smarter model
You don’t live in Google Docs/Gmail for work
You’re fine with Gemini 2.5 Flash (which is fast and handles most tasks well)
100 AI-generated images per day is enough

Upgrade to Pro ($19.99/mo) if:

You use Gmail, Docs, or Sheets daily and want AI directly in those apps
You need Gemini 2.5 Pro’s deeper reasoning for coding, analysis, or research
You’re hitting the free tier’s model limits (you’ll know – Gemini will tell you)
You want 1,000 AI credits/month for serious image or video generation

The context window – Gemini’s famous “1 million tokens” feature that lets it read entire books – is available on both free and paid. The difference is which model processes those tokens. Pro gives you consistent access to the smarter 2.5 Pro model; free gives you mostly Flash with occasional Pro access.

Access Gemini: Three Ways, Three Different Experiences

Google gives you three paths. Each has different limits.

1. Web/Mobile App (gemini.google.com)

Sign in with your Google account. That’s it. This is where most people start. Free users get 100 images/day with Nano Banana (the image generator), limited Pro model access, and a context window that ranges from 32K to 128K tokens depending on server load. According to Google’s own support docs, these limits “may change frequently.”

2. Google AI Studio (ai.google.dev/aistudio)

Developer playground. Same Google account login. You get 500-1,000 images/day here (dynamic based on load), 15 requests per minute, and full access to experiment with prompts. This is where you prototype before building.

3. Gemini API (Developer/Enterprise)

Requires API key from Google AI Studio. Free tier: 500 requests/day with Flash. Paid tier starts at usage-based pricing. If you’re building an app, this is your route. If you’re just using Gemini for work, ignore this.

Pro tip: The three access methods have independent quota pools. Power users stack all three – 100 images from the app, 500 from AI Studio, 500 from the API – for 1,100+ free images/day. This isn’t a hack; it’s how Google structured it.

The Workspace Permission Trap (Fix This Now)

If you want Gemini inside Gmail, Docs, or Sheets, there’s a setup step every tutorial glosses over. You need two permission toggles.

Open Gmail → click the gear icon (Settings) → select “See all settings”
Stay on the “General” tab, press Ctrl+F (or Cmd+F on Mac), search for “smart features”
Enable both “Smart Features” and “Google Workspace Smart features”
Scroll down, click “Save Changes”
Now open Gemini → click your profile icon → Settings → Apps tab
Enable the “Google Workspace” extension

Miss either step and Gemini won’t see your emails or docs. It fails silently – no error, just a useless “I don’t have access to that” response.

This two-step requirement exists because Google separates general AI features from workspace-specific ones. It’s a privacy boundary, but the UI doesn’t make it obvious.

What Actually Works Once Enabled

In Gmail: Draft replies, summarize threads, search your inbox with natural language (“find the email from Sarah about the Q2 budget”).

In Docs: Generate outlines, match writing style, rewrite sections. The side panel (✦ icon) is where Gemini lives.

In Sheets: Use the =AI formula to categorize data, translate text, or generate values based on other cells. Type =AI, describe what you want, drag down to apply to multiple rows.

Models: Fast, Thinking, or Pro?

Gemini gives you a model picker. Here’s what each actually does.

Model	Speed	Use Case	Free Access
Fast (2.5 Flash)	Instant	Everyday tasks, summaries, drafts	Unlimited
Thinking (3 Flash)	~5 sec delay	Complex reasoning, step-by-step logic	Limited daily
Pro (2.5 Pro or 3.1 Pro)	Slower	Deep analysis, coding, research	Very limited

Free users default to Fast. You can switch to Thinking or Pro manually, but you’ll hit daily caps. Pro subscribers get 100 Pro prompts/day and 300 Thinking prompts/day as of early 2026 (these numbers changed – Google doubled them in January 2026 after user backlash).

When you hit your limit, Gemini auto-switches you back to Fast mid-conversation. You’ll see a banner notification.

The 200K Token Pricing Cliff (API Users Only)

If you’re using the Gemini API for development, watch your input token count. Gemini API pricing has a hidden tier: prompts under 200K tokens cost $1.25 per 1M input tokens (2.5 Pro). Cross 200K and it jumps to $2.50 per 1M – double the price.

Output pricing also doubles: $10 to $15 per 1M tokens.

This isn’t a bug. It’s tiered pricing for “long context” workloads. But it’s buried in the pricing tables, and developers building RAG systems or document processors often miss it until the bill arrives.

Context caching is your workaround. If you’re repeatedly using the same large document (say, a 500-page manual), cache it. Cached input costs drop from $1.25 to $0.13 per 1M tokens – a 90% discount. You pay a small hourly storage fee, but if users chat with that doc more than a few times, you save money.

Multimodal Magic: Images, Video, and Why It Matters

Gemini was built multimodal from day one. Unlike GPT-4 or Claude (which added vision later via adapters), Gemini learned to see, read, and hear simultaneously during training. Practically, this means better image understanding and native video support.

You can upload:

Up to 3,600 images per request (yes, really – though you’ll never need this many)
Videos up to 90 minutes long
PDFs, which Gemini treats as images (it OCRs them)

Common use: Drop a screenshot of a data table. Ask Gemini to extract it as JSON. Or upload a diagram and ask it to explain the workflow. Or paste a chart and ask “what’s the trend?”

Images count as tokens. A 1024×1024 image = ~1,290 tokens. Larger images get tiled into 768×768 chunks, each costing 258 tokens. If you’re hitting context limits, resize images before uploading.

Deep Research: When Google Reads 100 Websites For You

Deep Research is Gemini’s killer feature if you’re a researcher, student, or analyst. You give it a topic. It spends 3-5 minutes browsing hundreds of sites, then delivers a multi-page report with citations.

Free users get it, but with daily limits (exact number varies – Google doesn’t publish it). Pro users get expanded access. Ultra users get the highest limits.

How it works: Gemini breaks your query into sub-questions, searches for each, reads the results, synthesizes findings, and writes a structured report. It’s not just summarizing top Google results – it’s following citation chains and cross-referencing sources.

When to use it: Market research, literature reviews, competitive analysis, policy summaries. When not to use it: quick facts (just ask Gemini normally), or topics where you need peer-reviewed sources only (Deep Research pulls from general web, not academic databases).

Three Gotchas No One Mentions

1. Output token cap breaks long responses

Gemini Pro can read 1 million tokens, but it can only write 64,000 tokens per response. If you ask it to summarize a 900-page document into a 100-page report, it’ll cut off mid-sentence at 64K tokens. Flash models cap even lower (8K-32K). Solution: ask for shorter summaries, or request the output in chunks (“give me pages 1-10, then I’ll ask for 11-20”).

2. Sensitivity filters trigger on innocent queries

Community users report Gemini refusing to answer historical or political questions that ChatGPT and Claude handle fine. Example from Reddit: asking about Jimmy Carter’s presidency failures required multiple rephrasings because Gemini flagged it as “too sensitive.” If you hit a block, rephrase neutrally or switch to another AI.

3. Model limits reset unpredictably

Google says limits are “distributed throughout the day,” which means they don’t reset at midnight. You might regain access to Pro model at 3 PM, or 9 AM the next day. There’s no countdown timer. If you’re on a deadline and hit your limit, you’re stuck with Fast until the invisible timer resets.

Gems: Custom AI Assistants You Build Once

Gems are Gemini’s version of ChatGPT’s Custom GPTs. You create a Gem by giving it instructions, uploading files, and setting a persona. Then you chat with it like a specialized assistant.

Example: Create a “Code Reviewer” Gem. Instructions: “You are a senior Python developer. Review code for bugs, style issues, and performance. Be direct.” Upload your team’s style guide. Now whenever you paste code, the Gem reviews it according to your rules.

Free users can create and use Gems. Pro/Ultra users get higher limits and priority access. Gems save time if you have repetitive tasks with the same context.

Where Gems shine: customer support templates, writing style guides, domain-specific experts (“legal contract analyzer,” “SQL query optimizer”).

Image Generation: Nano Banana vs Nano Banana Pro

Gemini generates images via two models: Nano Banana (fast, free tier gets 100/day) and Nano Banana Pro (higher quality, free tier gets 3/day).

Key difference in prompting: use “create” or “generate,” not “show.” If you say “show me a mountain,” Gemini searches the web and returns an existing image. If you say “generate a mountain at sunset,” it creates one.

Pro users get 1,000 images/day with Nano Banana and 100/day with Pro. Quality is comparable to DALL-E 3 or Midjourney for most use cases, though Midjourney still edges ahead on artistic texture.

When Gemini Fails (and What to Use Instead)

Gemini isn’t the best at everything. Here’s where it falls short as of April 2026:

Real-time data: Gemini’s training cutoff lags behind Perplexity or ChatGPT with browsing. For up-to-the-minute info, use those instead.
Artistic image generation: Midjourney still produces more aesthetically refined images. Gemini is fast and functional; Midjourney is art.
Mathematical proofs: Claude Opus 4.6 outperforms Gemini on advanced math reasoning. Gemini handles high school calc fine; for graduate-level proofs, switch tools.
Uncensored output: Gemini’s sensitivity filters are the most aggressive among major AIs. For controversial topics, ChatGPT or Claude are more permissive.

The 1 million token context window is real and works, but you’ll rarely need it. Most tasks fit under 50K tokens. The bigger win is Gemini’s workspace integration if you’re a Google user, and its multimodal strength for image/video analysis.

Frequently Asked Questions

Can I use Gemini without a paid subscription?

Yes. The free tier gives you Gemini 2.5 Flash (fast model) with unlimited use, limited access to Pro models, 100 AI-generated images/day, Deep Research with daily caps, and all core features. You only need to pay if you want consistent access to smarter models or Workspace integration.

Why does Gemini sometimes refuse to answer questions ChatGPT handles fine?

Gemini’s content filters are more conservative, especially around historical events, political figures, or anything it flags as potentially sensitive. This is a known community complaint. If blocked, rephrase your question more neutrally (avoid emotionally charged words), or use a different AI for that specific query. Google tunes these filters over time, so sensitivity varies by topic and month.

What’s the actual difference between the 1M token context window on free vs paid?

Both tiers support 1 million tokens in theory, but free users get a dynamically throttled window (often 32K-128K in practice) and limited access to the Pro model that uses the full context effectively. Paid users get consistent 1M token context with Gemini 2.5 Pro or 3.1 Pro. The context window is the capacity; the model tier determines how well Gemini uses it. Think of it like RAM: free users have less, and slower processing; paid users have the full 1M and a faster CPU.

Your next step: open Gemini and ask it a real question you need answered today. Don’t test it with “tell me a joke” – use it for actual work. That’s the only way to know if you need to upgrade.