🔍 In short. The italian perception verbs are vedere, sentire, guardare, ascoltare, udire, osservare, notare, scorgere. When something you saw or heard is also an action, Italian usually links the second verb in the infinitive: vedo Annalisa scendere dal palcoscenico, sento Fulvio chiudere il portone, guardo accendere le luci. English likes the -ing form (“I see Annalisa coming down”); Italian likes the bare infinitive. There are two flavours of the same pattern, a polite alternative with che + indicative, and a few pronoun rules that surprise English speakers. By the end of this guide the italian perception verbs will feel as natural as devo or posso.
Once you spot the pattern, it pops up everywhere: backstage at the Teatro Grande in Brescia, in the kitchen, on the phone. This B1 guide walks through the two structures, the pronoun question (l’ho vista cantare or gliel’ho sentita cantare?), the che alternative, the past participle agreement, and finishes with a stage-door dialogue, a cheat sheet and a quiz.
Cosa impareremo oggi
👆🏻 Jump to section
- What the italian perception verbs are
- Two patterns: vedo Annalisa scendere or vedo scendere Annalisa
- Pattern 1: vedo scendere Annalisa (event in focus)
- Pattern 2: vedo Annalisa scendere (person in focus)
- The che alternative: ho visto Fulvio che chiudeva
- Pronouns: l’ho vista cantare and gliel’ho sentita cantare
- Past participle agreement: vista or visto?
- When the infinitive is passive: sento criticare il regista
- Three mistakes English speakers make
- Cheat sheet: italian perception verbs
- Dialog: stage door at the Teatro Grande
- Frequently asked questions
- Related guides
What the italian perception verbs are
Walk into the artists’ entrance of the Teatro Grande in Brescia an hour before curtain and you’ll notice that the italian perception verbs do most of the work in normal speech: vedo arrivare l’orchestra, sento accordare i violini, guardo Fulvio chiudere il portone. The core group is small and you already know every verb in it.
- vedere (to see): the main visual verb.
- guardare (to watch, to look at): deliberate, attentive.
- osservare, notare, scorgere (to observe, to notice, to make out): different degrees of focused attention.
- sentire (to hear, to feel): the main hearing verb in everyday speech.
- ascoltare, udire (to listen, to hear): slightly more formal cousins.
What all the italian perception verbs share is that they can be followed directly by another verb in the infinitive, with no preposition in between: vedo cadere la pioggia, sento bollire l’acqua, guardiamo provare il coro. Where English reaches for the -ing form (“I see the rain falling”), Italian reaches for the bare infinitive. That single habit, once it clicks, fixes one of the most persistent learner mistakes at B1.
Two patterns: vedo Annalisa scendere or vedo scendere Annalisa
With the italian perception verbs, the same content can be packaged two ways. Both are correct, both are common, and the choice depends on what you want to highlight.
- Vedo Annalisa scendere dal palcoscenico.
I see Annalisa coming down from the stage. (focus on Annalisa) - Vedo scendere Annalisa dal palcoscenico.
I see Annalisa coming down from the stage. (focus on the event)
The first version puts the person right after the perception verb and treats Annalisa as the entity in view, with her action filling in the picture. The second tucks the person after the infinitive and treats the whole event as a unit. Speakers often switch between the two without thinking, the way English switches between “I saw Annalisa go down” and “I saw Annalisa going down”. The italian perception verbs share this freedom and you should make use of it in writing.
🔍 One mental hook. Pattern 1 (vedo scendere Annalisa) treats the event as a single block, like a scene captured in one shot. Pattern 2 (vedo Annalisa scendere) names the person first and then describes what she does. Same picture, different framing.
Pattern 1: vedo scendere Annalisa (event in focus)
In the first version of the italian perception verbs, the infinitive comes right after the perception verb and the person (or thing) follows the infinitive. This pattern works best when the action is the main news of the sentence.
- Sento cantare il soprano dalla sala prove.
I hear the soprano singing from the rehearsal room. - Vediamo arrivare i musicisti dell’orchestra alle tre.
We see the orchestra musicians arriving at three. - Fulvio ha sentito provare l’aria della Tosca per tutta la mattina.
Fulvio heard the Tosca aria being rehearsed all morning. - Annalisa guarda accendere le luci del palcoscenico una a una.
Annalisa watches the stage lights being turned on one by one.
One detail that catches English speakers off guard: when the infinitive has its own direct object, the person doing the action takes the preposition a (or da): sento cantare l’aria ad Annalisa, guardiamo provare la scena ai cantanti. The italian perception verbs share this rule with the causative fare (faccio cantare l’aria ad Annalisa), and that family resemblance is the easiest way to remember it.
- Sentiamo cantare l’aria ad Annalisa, è bravissima.
We hear Annalisa singing the aria, she’s very good. - Ho visto chiudere il portone a Fulvio verso mezzanotte.
I saw Fulvio closing the door towards midnight. - Guardiamo provare la scena finale ai cantanti.
We watch the singers rehearsing the final scene.
Pattern 2: vedo Annalisa scendere (person in focus)
The second version of the italian perception verbs slots the person between the perception verb and the infinitive. This is the pattern you reach for when the person is the news, when you want the listener to lock onto who is doing the action before hearing what they do.
- Vedo Annalisa scendere dalla scala di servizio con il faretto.
I see Annalisa coming down the service stairs with the spotlight. - Fulvio sente i tecnici gridare in graticcia durante il montaggio.
Fulvio hears the stagehands shouting up in the grid during the set-up. - Ho visto il direttore d’orchestra entrare in buca alle sette in punto.
I saw the conductor walking into the pit at seven on the dot. - Annalisa guarda Fulvio chiudere il portone dell’ingresso artisti.
Annalisa watches Fulvio closing the artists’ entrance door. - Sentiamo i cantanti scaldare la voce nei camerini al primo piano.
We hear the singers warming up their voices in the dressing rooms on the first floor.
Pattern 2 is the closer cousin of English. Vedo Annalisa scendere sits next to “I see Annalisa coming down” almost word for word, with the only difference being that Italian uses the infinitive where English uses -ing. With the italian perception verbs in this pattern, the second verb can also have its own object directly: sento Annalisa cantare l’aria, no preposition needed, because the person is already the object of sento.
🎯 Mini-task #1. Rewrite each sentence using both patterns of the italian perception verbs.
- I hear the soprano warming up.
- We see the conductor entering the pit.
- Annalisa watches the technician adjusting the spotlight.
- I heard the door closing at midnight.
👉 Show answers
1. Sento scaldare la voce al soprano / Sento il soprano scaldare la voce
2. Vediamo entrare in buca il direttore / Vediamo il direttore entrare in buca
3. Annalisa guarda regolare il faretto al tecnico / Annalisa guarda il tecnico regolare il faretto
4. Ho sentito chiudere il portone a mezzanotte / Ho sentito il portone chiudersi a mezzanotte
The che alternative: ho visto Fulvio che chiudeva
Alongside the infinitive pattern, the italian perception verbs accept a second construction: che + indicative. Instead of vedo Annalisa scendere you can say vedo Annalisa che scende; instead of ho sentito chiudere il portone you can say ho sentito Fulvio che chiudeva il portone. Both are standard Italian and natives use them interchangeably, with a slight tendency to prefer che + indicative in casual spoken sentences and the infinitive in tighter, more written-feeling sentences.
- Ho visto Fulvio che chiudeva il portone alle due di notte.
I saw Fulvio closing the door at two in the morning. - Annalisa ha sentito il regista che dava istruzioni al coro.
Annalisa heard the director giving instructions to the choir. - Vediamo gli orchestrali che accordano gli strumenti in buca.
We see the orchestra members tuning their instruments in the pit.
The tense inside the che clause must match the moment of perception. If you perceived the action live, the tense follows naturally: present after vedo, imperfect after ho visto or vidi. Vedo Annalisa che scende, ho visto Annalisa che scendeva. With the italian perception verbs the imperfect after a past tense is almost automatic because the perceived action was in progress while you were watching.
A useful detail: in the che construction, when you replace the person with an object pronoun, the pronoun sits before the perception verb, exactly where you’d expect it. L’ho visto che chiudeva il portone means “I saw him closing the door”. This is not the relative pronoun of il libro che ho letto; it’s a special perception structure, and trying to translate che as “who” or “that” will give you an awkward English sentence. Just hear it as “saw him closing”.
Pronouns: l’ho vista cantare and gliel’ho sentita cantare
Object pronouns with the italian perception verbs follow one rule that surprises most learners: the pronoun attaches to the perception verb, not to the infinitive. L’ho vista cantare, never ho visto cantarla. Gli sento suonare il violino, never sento suonargli il violino. Italian treats the perception verb and the infinitive as a unit and front-loads the pronoun onto the first verb.
- L’ho vista accordare il pianoforte prima del sound check.
I saw her tuning the piano before the sound check. - Li abbiamo visti uscire dall’ingresso artisti verso mezzanotte.
We saw them coming out of the artists’ entrance towards midnight. - Fulvio la sente cantare anche dal cortile interno.
Fulvio can hear her singing even from the inner courtyard.
Now the trickier case. When the infinitive has its own direct object, the person doing the action becomes an indirect object, so you switch to an indirect pronoun. Instead of l’ho sentita cantare l’aria (where la is Annalisa), if you also want to pronominalise the aria you get gliel’ho sentita cantare (“I heard her sing it”). The gli- is the indirect pronoun standing in for Annalisa, the la is the direct pronoun standing in for the aria, and the past participle still agrees with the aria (sentita, feminine because la = l’aria). This combination shows up constantly in real speech and is worth drilling.
- Gliel’ho sentita cantare due volte di seguito.
I heard her sing it two times in a row. - Glielo guardo controllare ogni sera all’ingresso.
I watch him check it every evening at the entrance. (it = the list) - Gliele abbiamo sentite provare per tutta la settimana.
We heard him rehearse them all week long. (le = the arias)
The simpler rule of thumb for the italian perception verbs: if the infinitive has no object of its own, just use a direct pronoun (l’ho vista, li abbiamo sentiti). If it has its own object that you also want to pronominalise, switch to the combined indirect plus direct (gliel’ho vista fare, gliele abbiamo sentite cantare). And the pronoun always lands on the perception verb, never on the infinitive.
Past participle agreement: vista or visto?
When the italian perception verbs appear in the passato prossimo with a pronoun in front, the past participle agrees with that pronoun in gender and number, exactly like in any other compound tense with avere. Ho visto Annalisa becomes l’ho vista. Ho sentito i tecnici becomes li ho sentiti. The rule is mechanical and predictable.
- L’ho vista scendere dal palcoscenico. (la = Annalisa)
I saw her come down from the stage. - Li ho sentiti applaudire dalla sala. (li = il pubblico, gli spettatori)
I heard them applauding from the hall. - Le abbiamo viste accordare i violini. (le = le violiniste)
We saw them tuning the violins. - L’ho visto entrare in buca. (lo = il direttore)
I saw him entering the pit.
One word of warning: in everyday speech you’ll sometimes hear the participle stay neutral (l’ho visto cantare referring to a woman, instead of vista). Italians understand it, but in writing and in exams the agreement is required when a direct pronoun precedes the verb. The italian perception verbs sit firmly inside this rule. The opposite case, agreement with a full noun that follows the verb (ho vistad Annalisa), is wrong: agreement only kicks in when the pronoun is already in front.
When the infinitive is passive: sento criticare il regista
One last twist with the italian perception verbs: when the second action is passive in meaning (“I hear the director being criticised”, “I see the trees being cut down”), Italian keeps the infinitive in the active form and lets context do the work. Where English forces a passive (“being criticised”), Italian says simply sento criticare il regista and the meaning is unambiguous because the regista is what gets criticised, not what does the criticising.
- Sento criticare il regista in sala prove.
I hear the director being criticised in the rehearsal room. - Ho visto montare la scenografia in tre ore soltanto.
I saw the set being built in just three hours. - Sentiamo applaudire il soprano da tutta la galleria.
We hear the soprano being applauded by the whole gallery. - Ho sentito intonare l’inno dal coro al piano terra.
I heard the anthem being sung by the choir on the ground floor.
If you want to name who does the action, you add it with da: sento applaudire il soprano dal pubblico, ho visto montare la scenografia dagli operai. The italian perception verbs handle this elegantly: no passive infinitive needed, no clunky construction, just active infinitive plus da plus agent. It is one of the cases where Italian is shorter and tighter than English, and noticing it will make your sentences feel more native.
Three mistakes English speakers make
Three slips with the italian perception verbs flag a B1 sentence as written by a learner. Fixing them is quick.
Mistake 1. Using the gerund. Wrong: vedo Annalisa cantando. Italian almost never uses the gerund after the italian perception verbs; it uses the infinitive instead. Correct: vedo Annalisa cantare. The gerund-after-perception habit is a direct calque from English -ing, and natives spot it instantly. The same applies to sentire, guardare, osservare: always infinitive, never gerund.
Mistake 2. Attaching the pronoun to the infinitive. Wrong: ho visto cantarla, ho sentito chiudergli il portone. The pronoun goes on the perception verb. Correct: l’ho vista cantare, gli ho sentito chiudere il portone. The italian perception verbs lock the pronoun to themselves and refuse to let it slide onto the infinitive.
Mistake 3. Forgetting past participle agreement. Wrong (in writing): l’ho visto cantare when referring to Annalisa. Correct: l’ho vista cantare. Whenever a direct pronoun (lo, la, li, le) precedes the italian perception verbs in the passato prossimo, the participle must agree in gender and number. Le abbiamo viste, li abbiamo sentiti, l’ho vista.
🎯 Mini-task #2. Fix or confirm each sentence about the italian perception verbs.
- Vedo Annalisa cantando l’aria di Tosca.
- Ho sentito chiuderlo a Fulvio.
- L’ho visto entrare in buca alle sette. (lo = il direttore)
- Le abbiamo sentito provare per tutta la mattina. (le = le cantanti)
- Gliel’ho sentita cantare due volte. (gli = ad Annalisa, la = l’aria)
👉 Show answers
1. Vedo Annalisa cantare l’aria di Tosca (no gerund after perception verbs)
2. Gliel’ho sentito chiudere (pronoun on perception verb, indirect for Fulvio plus direct for portone)
3. ✓ correct (lo = direttore, participle agrees masc. sing.)
4. Le abbiamo sentite provare (participle agrees with le, fem. plural)
5. ✓ correct (gli = Annalisa as indirect, la = aria, participle agrees with la)
Cheat sheet: italian perception verbs
One table, the whole system. Keep it open while you build your next sentence with the italian perception verbs.
| Question | Answer | Example |
|---|---|---|
| The core perception verbs | vedere, guardare, osservare, sentire, ascoltare, udire, notare, scorgere | vedo, sento, guardo + infinitive |
| Pattern 1 (event in focus) | perception verb + infinitive + person | vedo scendere Annalisa |
| Pattern 2 (person in focus) | perception verb + person + infinitive | vedo Annalisa scendere |
| Pattern 1 with own object | person takes a / da | sento cantare l’aria ad Annalisa |
| Alternative with finite verb | che + indicative | ho visto Fulvio che chiudeva |
| Pronoun position | always on perception verb, never on infinitive | l’ho vista cantare, gli sento suonare |
| Pronoun when infinitive has object | indirect for person plus direct for object | gliel’ho sentita cantare |
| Past participle agreement | with direct pronoun in front | l’ho vista, li ho sentiti |
| Passive sense | active infinitive plus da plus agent | sento criticare il regista dal pubblico |
| What NOT to do | no gerund, no preposition before bare infinitive | NOT vedo Annalisa cantando |
Dialog: stage door at the Teatro Grande
Annalisa, the lighting technician at the Teatro Grande in Brescia, runs into Fulvio, the long-time portinaio at the artists’ entrance, an hour before the opening night of Tosca. Watch every perception verb at work.
👩🏼🦰 Annalisa: Fulvio, hai visto entrare il direttore? Doveva essere qui per le sei.
👨🏻🦳 Fulvio: L’ho visto passare dieci minuti fa, è già in buca. Lo sento provare l’attacco del secondo atto adesso.
👩🏼🦰 Annalisa: Bene. E il soprano? Ieri sera ho sentito cantare in sala prove fino a tardi.
👨🏻🦳 Fulvio: Era lei, sì. L’ho vista uscire dal camerino verso le undici, sembrava stanca. Oggi è arrivata alle quattro per il riscaldamento.
👩🏼🦰 Annalisa: Allora la sentiremo cantare bene stasera. Senti, hai visto i tecnici dell’audio? Devo regolare i microfoni del palco.
👨🏻🦳 Fulvio: Sono saliti in graticcia mezz’ora fa. Li ho visti portare su due casse nuove, mi sa che hanno cambiato qualcosa nel mixer.
👩🏼🦰 Annalisa: Speriamo che funzioni tutto. L’altra sera ho sentito fischiare un microfono per tutto il primo atto, una sofferenza.
👨🏻🦳 Fulvio: Anch’io. Stavo controllando i biglietti all’ingresso e sentivo gli spettatori che si lamentavano. Mai sentito niente di simile in trent’anni di portineria.
👩🏼🦰 Annalisa: Ne abbiamo parlato col direttore tecnico stamattina. Glielo ha fatto sentire al fonico e adesso ha cambiato due cavi. Speriamo.
👨🏻🦳 Fulvio: Bene. Vado a chiudere il portone laterale prima che entri qualcuno senza pass. Ci vediamo all’intervallo, ti porto un caffè dalla buvette.
👩🏼🦰 Annalisa: Grazie Fulvio. Se senti suonare il campanello dell’allarme tecnico, vienimi a cercare in cabina luci.
Count the italian perception verbs in the exchange: hai visto entrare, l’ho visto passare, lo sento provare, ho sentito cantare, l’ho vista uscire, la sentiremo cantare, hai visto, li ho visti portare, ho sentito fischiare, sentivo gli spettatori che si lamentavano, mai sentito, glielo ha fatto sentire, senti suonare. Every pattern from the cheat sheet appears at least once in a single backstage conversation.
🎯 Mini-challenge. Describe a real scene from the last twenty-four hours in five Italian sentences, using each of the italian perception verbs at least once: vedere, sentire, guardare, ascoltare, osservare. Mix Pattern 1 and Pattern 2. Read it out loud once.
Test your understanding
Take the quiz below to test what you’ve learned about the italian perception verbs.
–
§
Frequently asked questions
Six questions about the italian perception verbs come up in every B1 cohort. For verb construction patterns more broadly, the Accademia della Crusca note on valenze e reggenze dei verbi is the standard reference.
What is the difference between ‘ho visto Annalisa mangiare’ and ‘ho visto Annalisa che mangia’?
Both are correct and natives use them almost interchangeably. The infinitive version (ho visto Annalisa mangiare) is slightly tighter, more written-feeling, and treats the event as a single block. The che plus indicative version (ho visto Annalisa che mangia) is the slightly more casual, more spoken option, and emphasises that the action was in progress when you saw it. With past tenses, the che clause usually takes the imperfect: ho visto Annalisa che mangiava. The italian perception verbs accept both constructions with no change in meaning.
Where does the pronoun go: ‘l’ho visto fare’ or ‘ho visto farlo’?
Only l’ho visto fare is correct. With the italian perception verbs the object pronoun must attach to the perception verb, never to the infinitive. So you say l’ho vista cantare (not ho visto cantarla), li ho sentiti uscire (not ho sentito uscirli), gli sento suonare il violino (not sento suonargli il violino). The perception verb and the infinitive behave as a unit and the pronoun loads onto the front. This is different from modal verbs, where both positions are allowed.
When do I say ‘gliel’ho sentita cantare’ instead of ‘l’ho sentita cantare’?
You use the combined pronoun gliel’ho when the infinitive has its own direct object that you also want to pronominalise. If you say l’ho sentita cantare, the la refers to the person (Annalisa) and there is no second object. If the aria becomes a pronoun too, the structure becomes gliel’ho sentita cantare: gli is the indirect pronoun for Annalisa (because she is now the doer of an action with its own object), and la is the direct pronoun for the aria. The past participle still agrees with la, so sentita is feminine. Tricky, but very common in spoken Italian.
Does the past participle agree? L’ho vista cantare or l’ho visto cantare?
L’ho vista cantare is the form expected in writing and exams. Whenever a direct object pronoun (lo, la, li, le) sits in front of the italian perception verbs in the passato prossimo, the past participle agrees in gender and number with that pronoun. So: l’ho vista (Annalisa), l’ho visto (il direttore), li ho sentiti (i tecnici), le ho sentite (le cantanti). In casual speech you will sometimes hear the neutral l’ho visto referring to a woman, but it is not the form to use in careful Italian.
Can I say ‘sento cantare l’aria DA Annalisa’ or only ‘A Annalisa’?
Both are possible with the italian perception verbs and they are nearly synonymous. Sento cantare l’aria ad Annalisa puts the focus on Annalisa as the singer, the way the causative fare works: faccio cantare l’aria ad Annalisa. Sento cantare l’aria dad Annalisa is more event-focused, almost passive in feeling, similar to English the aria sung by Annalisa. Native speakers often pick a depending on whether the listener already knows who the singer is, and da when introducing the agent for the first time. Both are correct standard Italian.
Why is ‘vedo Annalisa cantando’ wrong?
Because the italian perception verbs do not take the gerund. The mistake comes from translating the English -ing form directly. English uses two options after see and hear (the bare infinitive: I saw her sing; or the -ing form: I saw her singing). Italian uses only the infinitive: vedo Annalisa cantare or vedo cantare Annalisa, never vedo Annalisa cantando. The gerund in Italian has its own roles (stare plus gerund for ongoing action, gerund clauses for cause or time), but it never follows a perception verb. This is one of the cleanest signs that you have graduated from beginner to B1 thinking.
Ready for the next step?
All our classes are live on Zoom with a native Italian teacher, in small groups. If this lesson matches your level, take it further with real practice.

Milano A2-B1
Small group course · live on Zoom · native teacher
Move from the basics to real conversations, step by step, with a native Italian teacher who keeps the group small and the pace right for you.
- Small groups, max 4 students — weekly live Zoom lessons
- Grammar, vocabulary, listening and writing in every cycle
- Materials in Italian + English, beginner-friendly
- Homework after each lesson, corrected by your teacher

Individual classes
One-to-one · any level · live on Zoom
Private lessons with your dedicated native Italian teacher, fully tailored to your goals and schedule, from absolute beginner to advanced.
- 55-minute individual Zoom lessons, your dedicated teacher
- Personalised level assessment included
- Interactive online materials — homework after each lesson
- Flexible weekly schedule or pay-as-you-go package
Related guides
Three guides that pair with the italian perception verbs, plus an institutional reference on the construction.
- Italian Modal Verbs: Dovere, Potere, Volere, Sapere: the parallel pattern of verb plus bare infinitive.
- Italian Pronouns with Modal Verbs: the rules for pronoun position across verb chains.
- Italian Posso vs Riesco: A2 foundations on potere and riuscire plus infinitive, useful before this guide.
- Accademia della Crusca: valenze e reggenze dei verbi: the institutional reference on verb construction patterns in Italian, including perception verbs plus infinitive.





