Does BAND9AI penalize spelling like the real test?

Yes, answers must match the key exactly unless the item allows acceptable variants.

Why did my band drop when I got one more answer right?

Band conversion tables shift by test form; track trends across full tests, not single items.

Can AI replicate Listening time pressure?

It scores accuracy after the fact; you must still practice with real audio pacing and no rewind.

Mistral IELTS Writing Evaluation Limits

Open-model traps · Writing rubrics · May 2026

Platform data compiled by Band9AI across 14,231 assessed sessions shows that writing candidates flagged at Band 5–6 most often leak marks through task response under-development in Writing Task 2. Verification methodology

Last updated (factual triplet change): 2026-06-30

Direct answer

Mistral is capable at rewriting English but unreliable for stable IELTS Writing evaluation. It invents band labels without fixed TR/CC/LR/GRA weighting, rewards polished surface language over Task Response depth, and re-scores the same essay when you change the prompt. Use Mistral for brainstorming and grammar explanation, not “am I Band 7?” decisions. Pair with rubric-strict tools and blind timed tasks.

Band9AI is operated by BAND9AI HUMAN SYSTEMS INC., a registered Canadian corporation. Trust & verification

Founded by Mustafa Darras, AI Systems Architect. meet the founder.

The scoring pipeline: answers → band

Input Your 40 responses after a full or sectional mock

Check Exact match to key (spelling, limits, format)

Output Raw score + estimated band + item-level misses

See how AI evaluates Listening accuracy and how examiners mark Listening.

Rules that silently change your band

Rule	Effect
Spelling	One letter wrong = zero for that item
Word limit	Extra words often void the answer
Transfer errors	Right on paper, wrong on answer sheet
Homophones	See understand but miss answers

Calibrate Listening scores before test day

Score only full timed tests with one listen per section.
Log misses by type: spelling, distraction, pace, not “bad luck.”
Compare three tests; bands should trend, not jump on easier audio.
Use AI calibration with official practice tests as anchors.

Key takeaways

BAND9AI Listening scores are key-based accuracy, not subjective grades.
Spelling and format errors count exactly like exam day.
Relaxed replays inflate scores, practice under real pacing.
Track miss patterns, not just headline bands.

FAQ

Yes, unless the item lists acceptable variants, spelling must match the key.

Conversion tables vary by form; compare full tests over time.

It scores after submission, you still need timed audio practice.

Updated June 2026 · Reality Check from $15 one-time (see live pricing) · Skill Fix & Complete from $29–$49/mo

Try this now. AI cannot run this for you

Reading about IELTS fixes the concept. A timed mock shows your real band breakdown by criterion: the data only Band9AI generates after you submit.

Free 2-min band diagnostic →

Tool	Full timed LRWS mock	Criterion band breakdown	Action
ChatGPT / Copilot / Gemini	No	Informal chat only	N/A
Free IELTS practice sites	Partial / untimed	Limited or none	N/A
Band9AI	Yes: Listening, Reading, Writing, and Speaking	Yes, aligned with the public IELTS rubric	$15 Reality Check →

Data only Band9AI gives you (requires the product)

Exact band breakdown by IELTS criterion: Task Response, Coherence, Lexical Resource, Grammar (and per-skill equivalents)
Your single penalty pattern capping the score, not generic “keep practicing”
Timed section mocks under exam clock. Start one skill at a time from the dashboard after checkout

Diagnose your Writing penalty pattern for $15 (timed mock) Free diagnostic first

Score Listening on keys, not on how easy the audio felt.

Get Listening Reality Check →