The IELTS Speaking test is short — eleven to fourteen minutes — but it decides a number that can gate a visa, a university place or a job. The good news is that it’s the most predictable part of IELTS. The format never changes, the question types repeat, and two of the four things you’re scored on are skills you can train directly at home. This guide explains exactly how the test works, how the band is calculated, and how to practise each part so your score reflects how well you actually speak.
What the IELTS Speaking test is
IELTS Speaking is a face-to-face conversation with a certified examiner, not a computer. It runs the same way for both the Academic and General Training versions, and it’s recorded. The whole thing takes eleven to fourteen minutes and is split into three parts that get progressively more demanding — a familiar warm-up, a solo talk, and an abstract discussion.
Because it’s a real conversation, you can’t hide behind a memorised script. Examiners are trained to notice rehearsed answers, and they steer the talk to make you respond spontaneously. That sounds intimidating, but it’s actually freeing: you don’t have to predict exact questions, only prepare the topics and the skill of talking about them naturally.
How IELTS Speaking is scored
Your speaking band runs from 0 to 9 and is the average of four criteria, each weighted equally. According to the public IELTS band descriptors, those four are:
- Fluency and coherence — can you speak at length, smoothly, and connect ideas logically without long pauses or constant self-correction?
- Lexical resource — the range and precision of your vocabulary, including less common words and natural collocations.
- Grammatical range and accuracy — the variety of structures you use and how often they’re correct.
- Pronunciation — how easy you are to understand, including word stress, sentence stress, rhythm and intonation.
The detail that changes how you should prepare is the equal weighting. Pronunciation and fluency together make up half your speaking band — and they’re the two criteria that grammar drills and vocabulary lists do nothing for. They’re trained by speaking out loud and copying natural models, which is precisely what shadowing does. A candidate with solid grammar who sounds flat, hesitant and hard to follow will lose marks that a slightly less accurate but fluent, clear speaker keeps.
One more point that calms a lot of nerves: a native accent is not required and not rewarded. The pronunciation criterion measures clarity and natural features like stress and intonation — not whether you sound American or British. You can keep your accent and still score band 8.
The three parts, in order
Part 1 — the interview (4–5 minutes)
The examiner introduces themselves, checks your identity, then asks familiar questions about you: your home, your work or studies, your hobbies, daily routine, food, travel. Answers should be two or three sentences — a direct reply plus a reason or example. One-word answers waste the chance to show fluency; speeches break the rhythm of the interview. The full Part 1 breakdown covers the common topics and how to extend answers without sounding scripted.
Part 2 — the long turn (3–4 minutes)
You’re handed a cue card with a topic and a few prompts, given one minute to prepare and make notes, then asked to speak alone for one to two minutes. This is the part most candidates fear, because there’s nowhere to hide and no one to bounce off. The skill is organising ideas fast and talking at length. Our Part 2 cue-card guide shows a note-taking method and how to fill the full two minutes.
Part 3 — the discussion (4–5 minutes)
The examiner takes the Part 2 topic up a level into abstract, general questions — about society, change, opinions and the future. Here you have to explain, compare, speculate and justify. Part 3 is where higher bands are won and lost, because it rewards developed answers over quick opinions. See the Part 3 discussion guide for how to build answers that go beyond a single sentence.
Why so many people stall at band 6 to 6.5
A huge number of test-takers plateau just below the band they need, and it’s rarely because their grammar is weak. The usual causes sit squarely in the fluency and pronunciation criteria: long hesitations while searching for words, a flat delivery that makes them hard to follow, word stress in the wrong place, and answers that stop after one sentence. None of those are knowledge gaps — they’re production habits, and production habits are exactly what out-loud practice rewires.
This is why passive study often fails to move a speaking band. You can read every grammar rule and still freeze in the room, because the test measures what your mouth can do under pressure, not what your eyes can recognise on a page.
How to practise — a method that targets the band criteria
The most efficient prep attacks fluency and pronunciation directly, because that’s half your score and the half you can improve fastest.
1. Shadow model answers and real interviews. Find clear recordings of strong speakers answering the kinds of questions IELTS asks, and shadow them — repeat along about a second behind, copying the rhythm, stress and intonation. This builds the natural delivery the pronunciation criterion rewards. If shadowing is new to you, start with the complete shadowing guide.
2. Rehearse realistic questions aloud — then vary them. Practise the common topic types out loud, but don’t memorise word-for-word answers; train the idea so you can say it differently each time. This is what protects you when the examiner asks something you didn’t expect.
3. Record yourself and check pronunciation objectively. Your own ear is unreliable on your own speech. Record an answer, then run a sentence through the Pronunciation Checker to see which words and sounds let you down. Repeat the ones below the line until they’re clean.
4. Time yourself. Two minutes is longer than it feels in Part 2, and Part 1 answers that run too long cost you. Practising against the clock makes the real timing feel familiar instead of alarming.
How long does it take to raise your band?
There’s no honest fixed number — it depends on your starting point, how often you practise out loud, and how specifically you target your weak criterion. What’s consistent is the pattern: short, daily, spoken practice moves a speaking band faster than occasional long study sessions, because fluency and pronunciation are motor skills that fade without regular use. Diagnose which of the four criteria is holding you back, then spend most of your practice there rather than spreading effort evenly.
Tools that make IELTS speaking practice work
You can prepare with nothing but recordings and your voice, but a few free tools make each step measurable. The Pronunciation Checker turns a vague sense of “was that okay?” into a real score on the exact criterion examiners assess. The Text → IPA converter shows you the target sounds and stress for any phrase, and the text-to-speech reader gives you a clean model to shadow. When you’re ready to put it together, shadow real clips sentence by sentence in the Practice Player and get instant feedback on the fluency, pronunciation and intonation your IELTS band depends on — free, in your browser.