Pinyin (拼音) is the official system that spells Mandarin sounds with the Latin alphabet. Every syllable = an initial (starting consonant) + a final (vowel ending) + a tone. The catch: several letters don’t sound like English — c = “ts”, q = soft “ch”, x = soft “sh”, and zh/ch/sh are made with the tongue curled back.
Pinyin is the on-ramp to Chinese: nail it and every new word is readable; fudge it and you bake mispronunciations into thousands of words. It looks like English but plays by its own rules. The whole Mandarin sound system is only about 400 syllables — learn the building blocks once and you can read any pinyin you ever see. Here’s how to actually read it.
The structure: initial + final + tone
Take 好 hǎo: the initial is h, the final is ao, and the tone is the 3rd (dipping). Almost every syllable fits this slot machine. There are about 21 initials and around 35 finals; combine them with the tones and you have the entire spoken inventory. Learn the pieces and pronunciation becomes mechanical.
The full set of initials
Most initials are easy because they resemble English. The table groups them so you can see the patterns — pay special attention to the bottom two rows.
| Group | Initials | Notes |
|---|---|---|
| Lips | b · p · m · f | Like English; p is more strongly puffed. |
| Tongue-tip | d · t · n · l | Like English; t is strongly puffed. |
| Back of mouth | g · k · h | h is raspier than English, from the throat. |
| Front (tongue near teeth) | j · q · x | j ≈ “j”, q ≈ soft “ch”, x ≈ soft “sh” — tongue forward. |
| Retroflex (tongue curled back) | zh · ch · sh · r | zh ≈ “j”, ch ≈ “ch”, sh ≈ “sh”, r ≈ “s in measure”. |
| Whistle (tongue at teeth) | z · c · s | z ≈ “ds”, c ≈ “ts”, s ≈ “s”. |
Aspiration: the b/p, d/t, g/k secret
In English, b and p differ by voicing. In Mandarin the pairs differ by aspiration — the puff of air. The first of each pair is unaspirated (no puff), the second is aspirated (big puff):
| No puff | Big puff |
|---|---|
| b (like English “spy” p) | p (like English “pie” p) |
| d (like “sty” t) | t (like “tie” t) |
| g (like “sky” k) | k (like “kite” k) |
| j / zh / z | q / ch / c |
Test it: hold a tissue in front of your mouth. The aspirated sounds (p, t, k, q, ch, c) should make it flutter; the unaspirated ones shouldn’t.
The tricky initials (memorize these first)
| Pinyin | Sounds like | Example |
|---|---|---|
| c | “ts” in “cats” | 菜 cài — dish |
| q | soft “ch”, tongue forward | 去 qù — to go |
| x | soft “sh”, tongue forward | 谢 xiè — thanks |
| zh | “j” with tongue curled back | 中 zhōng — middle |
| ch | “ch” with tongue curled back | 吃 chī — to eat |
| sh | “sh” with tongue curled back | 是 shì — to be |
| r | like the “s” in “measure” | 人 rén — person |
The big confusion is the two “ch-like” series: j/q/x are made with the tongue near the front teeth, while zh/ch/sh curl the tongue back. Drill them as pairs (qī vs chī, xī vs shī) and your ear separates them quickly.
The special “-i” that isn’t “ee”
One of the most common beginner mistakes: after zh, ch, sh, r, z, c, s, the letter i is not pronounced “ee”. It’s a buzzy, neutral sound made by just holding the consonant’s tongue position and adding voice. So:
- 是 shì is “shrr”, not “shee”.
- 吃 chī is “chrr”, not “chee”.
- 四 sì is “suh/ssz”, not “see”.
But after j/q/x and most other initials, i is “ee” — 七 qī really is “chee”. Same letter, two sounds, decided by the initial.
The finals worth special attention
| Final | Sounds like | Example |
|---|---|---|
| ü | “ee” with rounded lips | 绿 lǜ — green |
| e (alone) | “uh”, not “eh” | 喝 hē — to drink |
| ian | “yen”, not “yan” | 天 tiān — sky |
| iu | actually “iou” | 六 liù — six |
| ui | actually “uei” | 水 shuǐ — water |
| un | actually “uen” | 春 chūn — spring |
| ong | “oong”, rounded | 中 zhōng — middle |
To make ü: say “ee”, then round your lips as if to say “oo” without moving your tongue. It’s the vowel in French “tu” or German “über.”
Spelling shortcuts that fool beginners
Hidden rules
- ü → u after j, q, x, y (ju is really jü, qu is qü). But lü and nü keep the dots, because lu/nu are different syllables.
- iu is short for iou, ui is short for uei, and un is short for uen — say the hidden middle vowel.
- y and w are spelling helpers for syllables that start with i/u (i → yi, u → wu, ü → yu).
- The tone mark always sits on the main vowel, in priority order: a > o/e > i/u. For iu and ui, it goes on the last letter.
Common mistakes to avoid
- Reading c, q, x, zh as English. These are the four letters that catch everyone — learn them deliberately.
- Saying “shee” for shi. Remember the special buzzy -i after retroflex/whistle initials.
- Ignoring the tone mark. Pinyin without the tone is half a word — always read the diacritic.
- Treating pinyin as the goal. It’s the bridge to characters and to typing, not the writing system itself.
Don’t learn pinyin silently
Pinyin is a map of sounds, so it must be learned with your ears and mouth, not just your eyes. Listen to a native syllable, then imitate it; compare minimal pairs; record yourself and compare. Pinyin plus the four tones (covered in our tones guide) is the complete Mandarin sound system — about 400 syllables, all reusable across every word you’ll ever learn.
Master pinyin with audio + a tone trainer
Hanzijo pairs every syllable and word with native audio, color-codes all five tones, and includes pinyin and minimal-pair drills so the q/x/zh-ch-sh distinctions click. Read it, hear it, say it — then lock it in with SRS.
Learn Pinyin the Right Way — FreeA 1-week pinyin starter plan
| Day | Focus |
|---|---|
| 1 | The easy initials (b p m f, d t n l, g k h) + simple finals (a o e i u) |
| 2 | j q x vs zh ch sh — drill as pairs with audio |
| 3 | z c s + the special buzzy -i |
| 4 | ü and the compound finals (ian, iu, ui, un, ong) |
| 5 | The four tones on top of syllables you already know |
| 6–7 | Read & record 30 real words; review misses with SRS |
Frequently asked questions
Should I learn pinyin before characters?
Yes — pinyin gives you accurate pronunciation from day one and underpins typing Chinese on a phone or computer. Learn it alongside your first characters, not in total isolation.
Is pinyin the same as the actual writing system?
No. Pinyin is a romanization for pronunciation and input; real written Chinese uses characters (hanzi). Pinyin is the bridge, not the destination.
How many sounds do I actually need to learn?
About 400 base syllables, made from ~21 initials and ~35 finals. With the tones layered on, that covers all of spoken Mandarin — a surprisingly small, finite set.
How do I type the tone marks and ü?
You usually don’t type tone marks for input — Chinese keyboards take plain pinyin (and ü is often typed as “v”), then you pick the character. Tone marks mainly appear in learning materials like this one.