Table of Contents
ToggleDifferent methods used for teaching speech – Auditory Global, Multisensory syllable unit, Association phoneme unit method, Cued speech, Auditory Verbal Therapy (AVT)
Auditory Global Method
This is a “top-down” approach that emphasizes the natural acquisition of speech through continuous, meaningful auditory exposure.
- Core Philosophy: Speech is best learned by listening to whole words, phrases, and sentences in natural contexts, rather than practicing isolated sounds.
- Key Characteristics:
- Maximal Auditory Input: Relies heavily on optimal amplification (hearing aids/cochlear implants) and a rich language environment.
- Focus on Prosody: Emphasizes the rhythm, intonation, and melody (supra-segmentals) of speech first, before worrying about perfect consonant articulation.
- Natural Context: Speech is taught during meaningful, everyday activities rather than in isolated drilling sessions.
- Best For: Children identified early with good access to sound via amplification, who can learn language incidentally.
Multisensory Syllable Unit Method
Often associated with Calvert and Silverman, this method uses the syllable, rather than the isolated sound or the whole word, as the foundational building block.
- Core Philosophy: The syllable is the most natural physiological unit of speech. Teaching isolated phonemes (like /b/) often leads to unnatural additions (like “buh”), which hurts co-articulation later.
- Key Characteristics:
- VAKT Approach: Utilizes all sensory pathways: Visual (watching the mouth/mirror), Auditory (listening), Kinesthetic (feeling the movement of articulators), and Tactile (feeling breath stream or vocal cord vibration).
- Syllable Drills: Students practice consonant-vowel (CV) or vowel-consonant (VC) combinations (e.g., ba, be, bi, bo, bu).
- Transition to Words: Once syllables are mastered smoothly, they are blended into meaningful words.
- Best For: Students who need structured, physical feedback to understand how to move their articulators, and those who struggle with smooth co-articulation.
Association Phoneme Unit Method (McGinnis Method)
Developed by Mildred McGinnis, this is a highly structured, “bottom-up” approach originally designed for children with severe language disorders (aphasia) but widely adapted for the deaf.
- Core Philosophy: Speech must be broken down into its smallest components (phonemes) and systematically built back up, associating sound, articulation, and written symbols.
- Key Characteristics:
- Phoneme First: Teaches individual sounds in isolation first (e.g., /m/, /a/, /t/).
- Cursive Writing as a Visual Anchor: Uniquely uses written letters (often cursive, to show the continuous flow of sounds) paired with the spoken sound.
- Northampton Vowel Chart: Frequently utilizes specific phonetic spelling charts to help students visually categorize vowel sounds.
- Strict Sequence: Sounds are blended into nouns, then verbs, then simple sentences, with strict mastery required before moving on.
- Best For: Students with multiple learning difficulties, apraxia of speech, or those who have failed to learn via holistic or naturalistic methods.
Cued Speech
Invented by Dr. R. Orin Cornett in 1966, Cued Speech is not a sign language (like ASL), but rather a visual support system for spoken language.
- Core Philosophy: Many spoken sounds look exactly the same on the lips (e.g., /b/, /p/, /m/). Cued Speech provides a visual cue to make 100% of spoken language visually clear.
- Key Characteristics:
- The System: Uses 8 handshapes (representing consonant groups) placed in 4 different locations around the mouth/throat (representing vowel groups).
- Phonetic Representation: It represents the sound of the language, not the spelling.
- Combined with Lipreading: The listener watches the speaker’s mouth movements simultaneously with the hand cues to determine the exact phoneme being spoken.
- Best For: Improving phonemic awareness, boosting literacy/reading skills in deaf children, and providing visual access to spoken language in inclusive environments.
Auditory Verbal Therapy (AVT)
AVT is a highly specialized early intervention approach that strictly focuses on listening and spoken language, explicitly avoiding visual communication cues.
Best For: Infants and toddlers who have received early, highly effective amplification (especially Cochlear Implants) and have highly involved parents.
Core Philosophy: If a child’s brain is given access to sound early enough via modern technology, they can learn to listen and speak just like a child with typical hearing.
Key Characteristics:
Auditory-Only Focus: Therapists and parents often cover their mouths (using a “hand cue” or acoustic hoop) to prevent the child from lip-reading, forcing the brain to rely solely on the auditory signal.
Parent as the Primary Teacher: AVT is heavily focused on parent-coaching. The therapist teaches the parent how to turn everyday routines into listening and spoken language lessons.
Mainstreaming Goal: The ultimate goal of AVT is for the child to be fully integrated into mainstream schools without needing sign language interpreters.
Acoustic Highlighting: Parents are taught to emphasize difficult sounds using pitch and pause, rather than visual pointing.
| Method | Primary Sensory Focus | Base Unit of Instruction | Key Characteristic |
| Auditory Global | Auditory | Whole Words/Sentences | Naturalistic, focuses on prosody and context. |
| Multisensory Syllable | Visual, Tactile, Auditory | Syllable (CV/VC) | Focuses on smooth co-articulation and physical feedback. |
| Association Phoneme | Visual, Auditory | Isolated Phoneme | Highly structured, uses written letters to anchor sounds. |
| Cued Speech | Visual (Supplemental) | Phoneme | Uses hand shapes to clarify ambiguous lip-reading. |
| Auditory Verbal (AVT) | Strictly Auditory | Varies (Follows typical dev.) | Parent-centered, explicitly removes visual cues to train the hearing brain. |
Introduction to Ling’s approach
Core Philosophy of Ling’s Approach
- Auditory Primacy: Ling believed that speech is best learned through the auditory channel. Optimal amplification (hearing aids or cochlear implants) is a non-negotiable prerequisite.
- Systematic Hierarchy: Speech skills must be taught in a logical, developmental sequence. A child cannot be expected to produce complex consonant blends if they have not yet mastered basic vocalization and breath control.
- Automaticity: Sounds must be practiced at the syllable level until they are completely automatic (requiring no conscious motor effort) before they can be effectively used in meaningful language.
- Ongoing Evaluation: The teacher/clinician must constantly assess the child’s phonetic repertoire to determine exactly which step to teach next.
The Dual-Level Model: Phonetic vs. Phonologic
Ling separated speech acquisition into two distinct, but connected, levels.
- The Phonetic Level (Motor/Physical):
- This is the mechanics of speech. It involves teaching the child how to physically produce sounds in isolation or in meaningless syllables (e.g., repeating ba-ba-ba or mee-mee-mee).
- The goal here is precise articulation, speed, and motor memory.
- The Phonologic Level (Meaning/Language):
- This is the application of speech. It involves taking the sounds mastered at the phonetic level and using them meaningfully in words, phrases, and conversations (e.g., using ba to say “ball”).
- Ling’s Rule: A sound is never targeted at the phonologic (word) level until it has been thoroughly mastered at the phonetic (syllable) level.
Ling’s Seven-Stage Sequence of Speech Development
Ling designed a strict 7-stage hierarchy for teaching speech. A child must master a stage before progressing to the next.
- Stage 1: Undifferentiated Vocalization
- Encouraging the child to use their voice freely, continuously, and on demand.
- Stage 2: Supra-segmentals
- Gaining voluntary control over breath, pitch, duration (long/short sounds), and intensity (loud/soft). This lays the rhythmic foundation for speech.
- Stage 3: Vowels and Diphthongs
- Vowels carry the “power” and volume of speech. All distinct vowels must be mastered because they form the core of every syllable.
- Stage 4: Consonants (by Manner of Articulation)
- Consonants are introduced in a specific developmental order based on how they are made (Manner) and where they are made (Place).
- They are always taught in combination with vowels (e.g., bee, bah, boo), never as isolated clicks or hisses.
- Order of acquisition: Nasals (/m/, /n/) and Plosives (/b/, /p/) are generally taught before Fricatives (/s/, /sh/).
- Stage 5: Consonant Blends
- Combining multiple consonants together (e.g., /st/ in stop, /pl/ in play).
- Stage 6: Meaningful Words (Phonologic Transfer)
- The child begins actively using their mastered sounds to name objects, request items, and express thoughts.
- Stage 7: Complex Conversational Speech
- The ultimate goal: using all speech sounds automatically and accurately in spontaneous, rapid, everyday conversation.
The Ling 6-Sound Test
One of Dr. Ling’s most famous and practical contributions is the 6-Sound Test. It is a daily, quick behavioral check used by parents and educators to ensure a child’s hearing technology is working and that they have access to the entire speech spectrum.
The Sounds (from lowest frequency to highest):
- /m/ (as in mouse): Tests very low frequencies.
- /oo/ (as in boot): Tests low frequencies.
- /ah/ (as in car): Tests mid frequencies.
- /ee/ (as in see): Tests mid-to-high frequencies.
- /sh/ (as in shoe): Tests moderately high frequencies.
- /s/ (as in sun): Tests the very highest speech frequencies.
How it is used:
- The adult covers their mouth (so the child cannot lip-read) and says the sounds in a random order at a normal conversational volume.
- The child must detect (indicate they heard it) and identify (repeat the exact sound back).
- If a child cannot hear the /s/ sound today, but could yesterday, it immediately alerts the educator to a device malfunction, a dead battery, or an ear fluid issue.
Individual and group speech teaching – advantages and limitations
Individual Speech Teaching (One-on-One)
Individual teaching involves a 1:1 ratio between the speech-language pathologist (or special educator) and the student. It is typically conducted in a quiet, distraction-free clinical or pull-out room.
Advantages:
- Hyper-Personalization: The entire session, materials, and pacing are tailored strictly to the child’s specific Individualized Education Program (IEP) goals.
- Maximum Target Practice: The child gets 100% of the speaking opportunities, allowing for high-repetition “drill” work (crucial for severe articulation or apraxia).
- Immediate & Precise Feedback: The clinician can instantly correct physical placement (e.g., tongue position) or acoustic errors without interrupting others.
- Reduced Distractions: Ideal for students with severe ADHD, autism, or sensory processing issues who are easily overwhelmed in group settings.
- Privacy & Comfort: Highly beneficial for self-conscious or older students (e.g., a teenager who stutters or a post-lingual child adjusting to a cochlear implant) who may feel embarrassed practicing in front of peers.
Limitations:
- The Generalization Gap: The biggest flaw of 1:1 therapy. A child might produce perfect speech in a quiet room with an adult, but completely fail to use those skills in a noisy classroom with peers.
- Artificial Environment: It lacks natural social dynamics. Communication is inherently social, but 1:1 therapy often feels like a sterile “testing” environment.
- Resource Intensive: It is highly expensive and time-consuming, requiring vast staffing resources that many public schools or clinics cannot sustain.
- Lack of Peer Modeling: The child only hears the “perfect” adult model, missing out on the opportunity to learn by watching peers attempt and master the same skills.
Group Speech Teaching
Group teaching involves one educator working with a small group of students (typically 2 to 6). Groups are often constructed homogeneously (students with the same speech errors) or heterogeneously (students with different, but complementary, communication goals).
Advantages:
- Natural Pragmatic Practice: Group settings naturally require social communication skills (pragmatics) that cannot be authentically replicated 1:1. Students must practice turn-taking, maintaining eye contact, topic maintenance, and active listening.
- Peer Modeling: Children often learn best from observing other children. Watching a peer successfully correct a speech error can be highly motivating.
- Better Generalization: Because a group more closely mimics a real-world classroom or playground, the speech skills mastered here are much more likely to transfer to the child’s daily life.
- Increased Motivation & Fun: Group therapy allows for interactive, game-based learning. The social pressure and competitive/collaborative elements often keep children engaged longer.
- Efficiency: It is a more cost-effective and time-efficient use of the educator’s schedule, allowing more children to receive services.
Limitations:
- Reduced Individual Attention: In a 30-minute session with three kids, each child may only get 10 minutes of direct practice. This is insufficient for children at the very beginning stages of learning a new motor-speech skill.
- Behavioral Disruptions: One unregulated or highly disruptive student can derail the entire session, wasting the time of the other students.
- Pacing Mismatches: If the group is not perfectly matched, one child may become bored because the pace is too slow, while another becomes frustrated because the pace is too fast.
- Acoustic Clutter: For children with hearing impairment, multiple voices and the rustling of a group setting create background noise that can make listening and accurate speech perception very difficult.
The Eclectic (Hybrid) Approach: Best Practice
Modern speech-language intervention rarely relies exclusively on just one method. The most effective approach is a strategic progression:
- Establishment Phase (Individual): The child begins with 1:1 sessions to physically learn the new speech sound, requiring intense focus, motor repetition, and specific feedback.
- Generalization Phase (Group): Once the sound is mastered in isolation or simple words, the child transitions into group therapy to practice using the sound in natural, conversational, and social contexts.
Aids and equipments for development of speech: Auditory aids (speech trainer), Visual aids (mirror etc.), tactile aids (Vibrotactile aids), software etc.
Auditory Aids (Maximizing Residual Hearing)
Auditory aids are designed to amplify sound with high fidelity, specifically for focused speech-language therapy, isolating the target sounds from background noise.
- The Speech Trainer (Auditory Trainer):
- What it is: A high-powered, desktop amplification unit (often hardwired). It consists of a high-quality microphone for the teacher/therapist, a microphone for the student, and heavy-duty headphones for the student.
- How it works: It provides excellent sound quality and allows the clinician to independently adjust the volume, pitch, and output limits for the right and left ears based on the student’s specific audiogram.
- Speech Application: Used for intense auditory-verbal drills, allowing the child to hear their own voice clearly (auditory feedback) alongside the perfect model from the therapist.
- FM Systems (Frequency Modulation):
- What it is: A wireless system where the teacher wears a microphone and the signal is transmitted directly to a receiver attached to the student’s hearing aid or cochlear implant.
- Speech Application: While primarily a classroom listening device, it is crucial for speech development because it completely eliminates background noise and distance, delivering crystal-clear phonetic models to the child.
Visual Aids (Seeing the Speech)
When the auditory channel is compromised, vision becomes the strongest compensatory sense. Visual aids help the child see what they cannot hear.
- The Mirror:
- What it is: The most fundamental, traditional, and indispensable tool in a speech clinic.
- Speech Application: The therapist and child sit side-by-side facing a large mirror. The child can watch the therapist’s articulatory placement (lips, teeth, tongue position) and immediately compare it to their own reflection. It is critical for teaching visible sounds (e.g., /m/, /p/, /b/, /f/, /v/, /th/).
- Airflow Visualizers (Low-Tech Props):
- What they are: Simple items like tissue paper, feathers, ping-pong balls, or bubbles.
- Speech Application: Used to teach “manner of articulation.” For example, holding a tissue in front of the mouth to show the burst of air for a plosive /p/ (the tissue jumps) versus the continuous airflow of a fricative /h/ (the tissue flutters).
- Anatomical Models/Charts:
- What they are: 3D physical models of the mouth or 2D sagittal cross-section drawings.
- Speech Application: Used with older children to explain where the tongue needs to go for hidden sounds (e.g., showing the tongue tip touching the alveolar ridge behind the teeth for /t/ or /d/).
Tactile Aids (Feeling the Speech)
Tactile aids utilize the skin’s sensitivity to vibration and pressure to convey acoustic energy, turning sound waves into physical sensations.
- Vibrotactile Devices:
- What they are: Wearable electronic devices (worn on the wrist, chest, or fingers) that convert acoustic speech signals into distinct vibratory patterns on the skin.
- Speech Application: Excellent for teaching supra-segmentals (the rhythm, duration, and intensity of speech) and for helping the child distinguish between voiced and voiceless consonants (e.g., /z/ vs. /s/).
- Direct Tactile Feedback (The Therapist’s Hands):
- What it is: The manual placement of the child’s hands on the therapist’s face or neck, and then on their own.
- Speech Application:
- Vocal Cords: Placing the hand on the larynx (throat) to feel the “buzz” of voiced sounds versus the stillness of voiceless sounds.
- Nasal Cavity: Placing fingers lightly on the sides of the nose to feel the vibration of nasal sounds (/m/, /n/, /ng/).
- Breath Stream: Placing the hand in front of the mouth to feel the warm air.
Software and Computer-Based Aids (Digital Biofeedback)
Modern technology transforms abstract speech concepts (like pitch and volume) into interactive, visual data on a screen.
Speech Application: A child might have to hold a steady vowel sound (“ahhhhh”) to make a digital car drive across the screen. If their volume drops or fluctuates, the car stops. This gamifies breath control and phonation.
Visual Speech Displays (e.g., Visi-Pitch, SpeechViewer):
What it is: Clinical software that analyzes a microphone input and displays the acoustic properties of the voice in real-time.
Speech Application: If a child with hearing impairment speaks in a monopitch, the software translates their pitch into a line on a graph. The child is asked to make the line “go up the hill,” visually training them to raise their vocal pitch. It is highly effective for controlling volume, pitch, and voice quality.
Spectrogram Software:
What it is: Software (like Praat) that creates a visual representation of the spectrum of frequencies of a sound as it varies with time.
Speech Application: Used mostly for older students to target precise consonant production (e.g., making sure the visual “fuzziness” of their /s/ sound matches the therapist’s /s/ sound on the screen).
3D Articulation Animation Apps:
What they are: Tablet applications that feature transparent, 3D animated heads.
Speech Application: Allows the child to literally look “inside” the mouth to see the exact movement of the tongue, jaw, and soft palate for every single phoneme in real-time.
Gamified Biofeedback Apps:
What they are: Apps designed for young children where voice controls a game.
Role of family in stimulation of speech and language and home training
The Fundamental Role of the Family
The family is not just a support system; in the context of early intervention, the parents are the primary teachers.
- The Primary Language Models: Children learn language through immersion and imitation. The family provides the foundational linguistic blueprint.
- The “Natural” Environment: Clinical therapy happens 1–2 hours a week; home life happens 24/7. Skills learned in a clinic must be generalized at home to become permanent.
- Emotional Bedrock: A child’s willingness to take the risk of communicating (especially when it is difficult for them) depends heavily on a secure, encouraging, and highly responsive home environment.
- Advocacy and Case Management: Families act as the central hub coordinating between speech-language pathologists (SLPs), audiologists, special educators, and the school system.
Core Principles of Home Training
Home training should not look like a sterile clinic. It must be integrated into daily life.
- Routine-Based Learning: The best language stimulation happens during predictable daily routines (bath time, mealtime, getting dressed). Routines provide repetitive, predictable vocabulary.
- Incidental Learning: Capitalizing on unplanned, teachable moments. (e.g., If a child drops a spoon, the parent uses that moment to teach the word “Uh-oh,” “Drop,” or “Spoon”).
- Following the Child’s Lead: Instead of forcing a child to talk about a flashcard, the parent observes what the child is already playing with and attaches language to that specific interest.
- Consistency over Intensity: 10 minutes of focused, high-quality language interaction every single day is far more effective than a single 2-hour “cram” session on the weekend.
Specific Language Stimulation Strategies for Parents
SLPs actively coach parents to use these specific, evidence-based techniques at home.
- Self-Talk: The parent narrates their own actions out loud while the child is nearby.
- Example: “I am washing the dishes. Wash, wash, wash. Now I’m drying the big plate.” (Provides a constant bath of vocabulary).
- Parallel Talk: The parent narrates what the child is doing, acting like a sports broadcaster.
- Example: “You are pushing the red car. Vroom! The car goes fast.”
- Expansion (Grammar): The parent takes what the child says and repeats it back with the correct, adult grammar.
- Child: “Doggy run.” -> Parent: “Yes, the doggy is running!”
- Extension (Vocabulary): The parent takes what the child says and adds new semantic information to it.
- Child: “Big car.” -> Parent: “Yes, that is a big, shiny, blue car!”
- Communication Temptations (Sabotage): Purposefully arranging the environment so the child must communicate to get what they want.
- Example: Putting a favorite toy in a clear, tightly sealed plastic container and handing it to the child, prompting them to ask for “Help” or “Open.”
- The “Rule of 3”: For children with hearing impairment or severe delays, giving them extra time to process. Pause, look expectantly, and wait at least 3 to 5 seconds before jumping in to answer for them.
Modifying the Home Environment
The physical environment of the home heavily impacts speech perception and production.
- Acoustic Management: For children with hearing aids or cochlear implants, parents must actively reduce background noise (turning off the TV/radio when not actively watching, putting rugs down to reduce echoes) during critical conversation times.
- Visual Access: Ensuring the child can clearly see the speaker’s face. Parents must learn to get down to the child’s eye level rather than shouting instructions from across the room or from another floor.
- A Print-Rich Environment: Labeling items around the house and having highly accessible, age-appropriate books to bridge the gap between spoken and written language.
Challenges and Barriers for Families
While family involvement is the gold standard, educators and clinicians must recognize the realistic barriers families face.
- The Grief Cycle: Upon a diagnosis of hearing impairment or severe autism, parents often grieve the “typical” experience they expected, which can temporarily delay their ability to actively engage in training.
- Time and Burnout: Working parents often struggle to find the energy to turn every interaction into a “therapy moment.”
- Financial Stress: The cost of hearing aids, private therapy, and specialized materials can place immense strain on the family, shifting their focus from daily stimulation to financial survival.
- Lack of Confidence: Many parents feel unqualified to be “teachers.” A major role of the SLP/Educator is to empower the parent, reminding them that their natural bond with the child makes them the most qualified person for the job.
