therapies 14 min read

What exactly happens in speech therapy — a session-by-session walkthrough

Most parents book their first speech therapy session without a clear picture of what their child will actually be doing. Here's what a session looks like minute by minute, how targets are chosen, and how we measure whether it's working.

Written by
Neuronurture clinical team
Senior speech-language pathologists, ABA analysts, occupational therapists, and child psychologists, supervised by our team of developmental paediatricians
Reviewed by
Chief Medical Officer
MBBS · DNB (Paediatrics) · Fellowship in Developmental & Behavioral Paediatrics · Karnataka Medical Council registered
Published 11 May 2026
Take the first step

Book your free consultation.

A 30-minute consultation with a developmental paediatrician or senior therapist. Free, no obligation.

Free Callback in 2 hours No obligation
Get appointment
We call within 2 hours · 100% free

The most common moment of nervousness in a first speech therapy session isn’t the child’s. It’s the parent’s, sitting just outside the camera frame, watching their 3-year-old click on a sound game with a stranger on screen and wondering what is actually supposed to happen here. Most parents have never seen a therapy session before, and they hold a mental picture stitched together from films and second-hand stories — flashcards, stern correction, a child reluctantly reciting words. The reality is almost never that. But “almost never that” doesn’t really tell you what speech therapy is, which is what most parents would actually like to know before they book.

So: what actually happens in a speech therapy session, from the first contact to the hundredth, and how is it different from a parent at home cheerfully reading books with their kid? The short version is that the sessions are built around specific, measured targets; the techniques are shaped by decades of evidence about what actually moves the needle for paediatric communication; and the data that comes out of every session is used to adjust the next one. The longer version follows.

Before the first session — the intake and assessment

Most families’ first contact is a 30-minute consultation, free in our case, where you describe what you’re noticing about your child and we describe what assessment and therapy at our clinic would look like. We ask about milestones, medical history, hearing screening, language environment at home, and what’s prompted you to seek help now. By the end of that call, you have a clear sense of whether full assessment is the right next step or whether watchful waiting with structured parent coaching might serve better.

If assessment is the right next step, we book a 60–90 minute session — sometimes split across two — for a structured speech-language assessment. A paediatric speech-language pathologist (SLP) maps four domains:

  • Receptive language — what your child understands. We assess with age-normed instruments and structured play observation, looking at vocabulary comprehension, following directions, and grammatical understanding.
  • Expressive language — what your child produces. Vocabulary count, sentence length, sentence structure, narrative coherence at older ages.
  • Articulation and phonology — how clearly individual sounds and sound combinations are produced. Some children have a few specific sound errors (the /r/, /k/, /g/, /s/); some have broader phonological process patterns that affect many sounds together.
  • Pragmatics — the social use of language. Turn-taking, eye contact, joint attention, conversational repair, perspective-taking. This domain is particularly relevant for autistic children and for children with social-communication differences.

The assessment also screens for two things that often look like primary language problems but aren’t: hearing loss (most paediatric speech delays in India that don’t respond to early intervention have unresolved otitis-media-with-effusion history we recommend ENT review for) and oral-motor structural concerns (tongue-tie, palatal differences) that an SLP refers out for clinical review.

You receive a written assessment report within 5–7 working days. It names which domain(s) are affected, severity, and the recommended intervention pattern — how many sessions per week, expected duration of an intervention cycle, and the specific targets we’d start with.

What an actual session looks like — preschoolers

For a 3-year-old who’s working on expressive language and articulation, a 30-minute online session typically runs like this:

0:00 — 0:03. Settling and greeting. The therapist greets the child by name in a warm, child-friendly voice, often with a familiar puppet or character. The parent is usually in-frame or just off-camera, and the therapist briefly checks in: “How was school? Anything I should know about today?” This isn’t filler. It’s affective regulation — making sure the child arrives calm enough to learn.

0:03 — 0:08. Warm-up activity. A familiar game with low cognitive demand — a song the child knows, a counting routine, blowing virtual bubbles. The point is to load up easy successes before introducing new material. This is the moment to deploy reinforcers (verbal praise, an animated sticker, a small earned reward) for engagement so the rhythm of “respond → get something good” is established.

0:08 — 0:22. Main activity, target-focused. The bulk of the session is built around two or three specific targets from the current cycle. For an expressive-language target, this might be eliciting two-word combinations through a structured play activity — a farm scene where the child needs to say “horse jump” or “cow eat” to make the action happen. For an articulation target, this might be elicitation drills for the /k/ sound at the word-initial position (“cat, key, cookie, car”) embedded inside a guessing game so the practice doesn’t feel like drill. The therapist is collecting data the whole time: percentage of correct productions, types of errors, level of prompting needed.

0:22 — 0:27. Generalisation activity. Take the target into a less structured moment. The child describes a picture, narrates a short video, or has a brief conversation with the therapist. The goal here is to see whether the skill survives outside the controlled drill — that’s where real-world use eventually lives.

0:27 — 0:30. Wind-down, parent debrief, home-practice setup. The session closes with a fun activity to end on positive affect. Then the therapist spends two or three minutes with the parent: what we worked on, what to practise this week, what to look for. If you’re not getting this debrief at the end of every session, ask for it — it’s the single highest-leverage three minutes of the week for home generalisation.

What it looks like for older children

For a 7-year-old working on phonological awareness, fluency, or pragmatic language, the proportions shift. There’s still a settling moment and still a wind-down, but the middle section becomes more cognitively explicit. The child knows what we’re working on; they can name the target (“I’m working on my /r/ sound”) and self-monitor. Sessions often include short focused practice followed by application to reading or conversation, with the child taking more active responsibility for noticing their own errors. The therapist’s role shifts from heavy scaffolding toward coaching and meta-cognitive support.

The shape of a session looks like play in early childhood and looks like practice with reflection by middle childhood. The data underneath is the same.

For school-age children working on fluency (stuttering), sessions often include both direct technique work (easy onsets, breath control, light articulatory contact) and indirect work (attitudes, avoidance reduction, self-advocacy). The combination matters more than either alone. The Cochrane evidence is clear that fluency therapy works best when emotional and behavioural components are addressed alongside the speech mechanics.

What the parent does during sessions

For under-5s, parent presence is usually the default. Your role during a session is neither to lead nor to step back entirely — it’s to observe with intention, so that you can later replicate the technique the therapist used. The therapist will narrate (“watch how I’m pausing for 5 seconds before I name the object — that wait time is what’s giving your daughter space to attempt”), which is partly for the child and partly for you. By session 4 or 5, many parents can confidently apply the same techniques in the next day’s lunch routine. That transfer is the most important thing that happens between sessions.

For older children, parent presence shifts to the start and end of sessions — debrief at the start, summary at the end, and the middle is the child’s space with the therapist. This protects the developing autonomy of the child and reduces the social complexity of three-people-on-one-call.

How we measure progress

Three layers, each with a different time horizon:

  1. Trial-level data, captured every session. Percentage of correct productions on the current target, level of prompting required, error types. This is the granular tracking that drives the next session’s adjustments. You see this on the weekly report.
  2. Cycle-level review, every 6–8 weeks. A formal review of which targets have resolved, which need more cycles, which need replacing. Re-assessment on age-normed measures if a domain has shifted enough to retest meaningfully. This is when the therapy plan formally adjusts.
  3. Long-term generalisation, evaluated every 4–6 months. Does the child use gains in real life — at school, with peers, in unscripted moments? This is the highest-stakes measure. If a child is showing 90% accuracy in the therapy room but 20% accuracy in the classroom, we have a generalisation problem and we re-design.

The Cochrane review on paediatric speech-language interventions is consistent on this point: short-term measures correlate poorly with long-term function unless you also track real-world use. Any therapy that measures only the first layer is undercounting what matters.

How long does it take?

It depends — predictably — on what you’re treating. A single sound error in an otherwise typically developing 5-year-old often resolves in 12–20 sessions. A broader phonological disorder usually takes 30–60 sessions across multiple cycles. Mixed receptive-expressive language disorders take longer still, often a year or more of consistent weekly work. Fluency therapy is rarely “finished” in the way articulation is — the goal is functional management, not zero stuttering, and the work tapers rather than ends.

We name an honest expected duration at the assessment report. If we’re not at least directionally accurate by 4–6 months, we re-assess and explain the deviation. That candour is part of the contract.

What changes between sessions matters as much as the sessions

A child who attends weekly therapy for six months and does no targeted practice in between will progress, but slowly. A child whose parent applies 10 minutes a day of the techniques modelled in the session will progress two to three times faster on most measures. This isn’t speculation — it’s the central finding of decades of parent-coaching intervention research, summarised in the ASHA practice portal and reinforced by every large-scale paediatric outcomes study since.

So: the sessions are necessary, and the between-session work is half of what makes them effective. That’s why every session ends with a clear, specific home-practice ask — and why the parent who shows up to session 12 with the home practice consistently done is the parent whose child is, statistically, doing best.

What it costs

Per-session pricing is shared transparently after the free assessment. The free 30-minute first consultation is exactly that — free, no commitment, no upsell. If we don’t think therapy is what your child needs, we’ll tell you and direct you to the right next step. If we think therapy is the right call, the assessment report names the specific cycle structure and total expected cost ranges, so you’re never guessing.

If you’d like to book the free consult, our contact form routes to a clinician on the same day. Or explore our speech therapy programme for the full intake-to-discharge view.

Backed by
ASHA — American Speech-Language-Hearing Association Cochrane Database of Systematic Reviews Rehabilitation Council of India (RCI) World Health Organisation (WHO) Indian Academy of Pediatrics (IAP)
View sources
  1. 01
    ASHA — American Speech-Language-Hearing Association · Practice Portal: Treatment of Speech and Language Disorders in Children
  2. 02
  3. 03
    Rehabilitation Council of India (RCI) · Standards for Speech-Language Pathology Practice in India
  4. 04
  5. 05
    Indian Academy of Pediatrics (IAP) · Guidelines on Developmental Surveillance and Stimulation

Reviewed by Chief Medical Officer (MBBS · DNB (Paediatrics) · Fellowship in Developmental & Behavioral Paediatrics · Karnataka Medical Council registered). Educational content; not clinical advice.

Common questions

Questions parents also asked.

How long is a typical speech therapy session for a young child?

30 minutes is the standard session length at our practice across age bands — long enough to do meaningful targeted work, short enough to keep paediatric attention. Toddlers between 18 and 30 months sometimes do better with even shorter, more frequent sessions because attentional bandwidth is the limiting factor at that age. The Cochrane review on speech-language interventions found that what matters more than session length is frequency (two to three weekly sessions outperform weekly) and the consistency of between-session practice.

Will my child actually 'do' speech work, or is it all play?

Both — and on purpose. For preschool-age children, play *is* the work. A therapist building joint attention, expanding vocabulary, or shaping articulation around a pretend-tea-party is doing structured therapy that looks like play from the outside. The targets, the data, and the progressive shaping are all happening; the child experiences the time as enjoyable. As children get older (around 6+), sessions involve more explicit instruction and practice drills, because the cognitive scaffolding is in place to handle that. Either way, every session is built around defined targets, not 'just talking together.'

Why does the therapist want me in the session at first, then ask me to step out later?

Two reasons. Early on, parent presence stabilises the child emotionally and lets the therapist coach you on techniques in real time — which is much of how the home gains happen. As the therapeutic alliance forms (usually within 4–8 sessions), the child often does better with the parent observing from the side or another room, because the child stops splitting attention. The right pattern varies child-to-child; a good therapist will explain when and why she's making the shift.

How will I know if therapy is actually working?

Two layers. Short-term, you'll see specific targets resolve — the /k/ sound moves from 0% accuracy to 80%, vocabulary jumps, requests start with words instead of pulling your hand. Your therapist's weekly report should name the targets and the percentages. Long-term, you'll see generalisation: your child uses the gains in places no one taught them — at the grandparents', with peers, in unscripted moments. Both matter. If after 8–10 sessions you're not seeing change on the named targets, ask. Sometimes the target needs adjusting; sometimes the technique does; rarely the diagnosis underneath needs revisiting. A good therapist will surface this before you have to.

Does online speech therapy work as well as in-person?

For most paediatric speech-language goals, yes — the published comparative-effectiveness studies and meta-analyses since the wider adoption of tele-therapy during 2020–2022 have generally found no significant outcome difference for speech-sound disorders, language disorders, fluency, and most pragmatic-language work. Two caveats: feeding therapy and severe phonological disorders involving precise oral-motor visualisation still favour in-person; and online therapy works best when the parent is present and engaged (which, for our families, is the default model). For everything else, the convenience advantage of online — same therapist every week, no commute, no waiting-room fatigue — usually outweighs any small effect-size gap.

About the author

Neuronurture clinical team

Senior speech-language pathologists, ABA analysts, occupational therapists, and child psychologists, supervised by our team of developmental paediatricians

Articles authored by working clinicians at Neuronurture Kids — speech-language pathologists, occupational therapists, behaviour therapists, and special educators — collectively responsible for the practice's published guidance to parents.

Free consultation

Talk to a paediatrician — free, in 30 min.

Honest read, no script, no pressure. We listen, we observe, we tell you what we think.

Get appointment
We call within 2 hours · 100% free
Free 2-hour callback No obligation