Perception and Cognition
Eye movements:
Illusions: When we perceive things different from what they actually are. Can be seen in static images, in space, and in time
Cross-displinary approach:
Cognition: Human understanding of their environment (thoughts)
Perception: Human experience their environment (senses)
Window: A channel to collect information-sensory and processing system.
Paradigms of information processing
Object and Face Perception:
Central scientific approach: acquisition, processing, storage, recall of data in the human brain
Perceptual filter: Channel tuning (size tuning)
click to edit
Transferring across sensory modalities
click to edit
sensory substitution: replacing a (lost/missing) sense with some other sense!
(one in every 25,000 people !)
synaesthesia: a mixing of senses causing a person to experience such things as colored hearing, gustatory sights, and auditory smells...
ventriloquism: speaking or uttering sounds so that they seem to come from the speaker’s dummy or some other source than the speaker
Sensory modalities
OTHER : magnetoreception? electroception? ultrasound?
PLUS : temperature ? pain ? balance ?
FIVE SENSES : sight, hearing, touch, smell, taste
Theories of perception: Gesalt theory-estalt theory has its focus on the principles of perceptual organisation: ‘the whole is more than the sum of the parts’ ‘laws’ describing perceptual phenomena <are these explanations?> (Wertheimer, Koffka, Kohler)
click to edit
praegnanz: good shape • similarity, symmetry
• proximity
• smooth continuation
• closure
• common fate
Direct perception: emphasizing bottom-up processing, exploiting the richness of information contained in sensory data, direct use for behavioral control without the need for high-level representation
click to edit
comprehensive capture of information in optic array
• unambiguous information about spatial layout (flowfield)
• object affordances: meaning/usabilit in behavioural context
• resonance: process to extract information (~ filter tuning
Constructivist approach: Emphasizing top-down processes in perception to resolve ambiguities the mind tries to make the best sense of limited, noisy data (Neisser, Gregory)
click to edit
active ‘construction’ of percepts
• iterative comparison of sensory input with internal (stored) knowledge
• hypotheses & expectations generate specific errors (illusions)
• physiological basis ?
• revival: Bayesian estimation
Information processing: neuro-scientific & computational approach to perception: first information processing steps as a basis for cognitive psychology.
click to edit
receptor: transform stimuli to neural signals
• receptive field: localisation and tuning
• filter: encoding of information efficiently
• representation: cortical processing and mapping • illusions: inherent misrepresent of physical world • active sensing: intelligent search system strategies
Properties of an image
spatial vision
two-dimensional
static
basic properties: brightness
& colour & patterns • but we ‘see’ much more...
• flicker, movement
Black and white coloration: Amount of light> objective luminance> subjective brightness
interpret visual system as measurement device (like a ruler) :
how many shades of grey can you discriminate? < 5,000 perceptual shades of grey (8-bit computer display has no more than 256 grey levels)
click to edit
Colour : small number linguistic categories: different number of colour names in different cultures (Berlin & May 1969)
perceptual colour space: arrange colours
objective ‘
brightness’
according to similarity: Colour circles
measure discrimination thresholds
3 M different colours
(24 bit computer display ~ 16 M)
Contrast:the central square at the top looks brighter than the square at the bottom >> brightness contrast illusion. The centre colour stays the same, however the outer colour is different thus creating this illusion.
‘filters’ in the visual stream subtract stimulus intensity (lightness) in surround region (red region) form lightness in the centre (green region)
The balance between centre and surround is attributed as perceived brightness to the centre
The important perceptual measure is not absolute but relative stimulus intensity, in comparison to its surround: (simultaneous) contrast
The process of taking in information:the outside world is represented in retinotopic maps of neurons with centre-surround receptive fields = ‘filters’
Spatial filters with opponency receptive fields > enhance contrast, reduce redundancy (areas of no change) > image compression
Simultaneous contrast filters
The grey spots in the Hermann grid are believed to be the result of opponency filtering (contrast enhancement: excitation - inhibition)
Inside black squares: no stimulation of excitatory centre and inhibitory surround : no overall excitation >> perceived as dark
white bars: only small parts of the inhibitory surround are stimulated: excitation dominates inhibition >> bright
white intersections: larger parts of the inhibitory surround are stimulated: small overall excitation >> apparent reduction of brightness
Contrast illusion: enhancement of colour differences in space when presented next to each other
Colour assimilation: Bunker-white illusion- where object boundaries override simple colour contrast effects.
click to edit
After effect of illusion- Successive colour contrast
Adaptation :
• green-red stimulus > response jumps up and then gradually returns to resting levels.
• NOTE: this encodes only changes (in time)
Opponency :
• red-green are subtracted from each other
• NOTE: this is contrast enhancement
Perceived aftereffect:
Opposite in time = successive colour contrast
• red-adapted regions turn green
• green-adapted regions turn red
• NOTE: contrast enhanced in the time domain
click to edit
Depth-dimension, three cues:
Cue 1:Pictorial depth-a wide range of depth information can be directly extracted from a static monocular (single eye) image.
The different depth cues
Size, perspective, occlusion
Motion parallax
Using two eyes
Texture, contrast, shading
Cue 2: apart from extending the visual field, the combination of information from the two eyes allows precise depth measurements through stereopsis
Stereopsis:Retinal projection of object on opposite sides of the fovea (disparity δ) indicates its depth relative to the plane of fixation. stereopsis is exploited to produce depth impressions in projected/printed pictures (Wheatstone stereoscope, red-green anaglyphs, lenticular cards, magic eye
Cue 3: Relative size/Ponzo illusion- Convergence of two lines, this suggest a perspective which puts the two horizontal at different apparent distances.
Real-world illusion- Ames room: Room with a wall built at an angle to give a perceived size of an object ( generally the same size as the same corresponding object) and then the second object is placed away from the viewer at the other end of the room. The slanted wall and placement of the objects will cause the observer to think that they are different sizes.
'We don’t just see features - we automatically put them together into meaningful parts of the scene' We do this effortlessly.
Detection and recognition: Deciding something is there and then drawing on our memory to recognise what it is.
Grouping: he brain has some automatic rules for deciding how to organize features:
what goes together to form an object
Contours for visual objects:
Luminance boundary: in regions of equal luminance excitation and inhibition cancel each other
at the boundary excitation and inhibition are not balanced and thus increase the relative difference of perceived brightness
We extract object as different spatial scales.
Mona-Lisa example:
Some neurons respond at low spatial frequencies – regions.
High spatial frequencies – details.
Shape & Shading: o recognize an object we have to have an idea of its 3D shape
Shading is very important to do this Texture/depth
Face perception separately:
Important part of our communication with each other - evolutionary context
Difficult problem that we solve very well
Extremely variable, yet highly constrained
Face detection: The first step of the process has to be the detection of a face, which requires extracting the features that all faces have in common
This could be achieved by simple “template matching”
Face recognition: Once a face is detected it can be further analysed to categorise it.
Attributions such as emotions can be used to assist in the process.
Challenges:
Discrimination
Generalization:
- lighting conditions
- varying context
- rigid and non-rigid
transformation
We have a tendency to see faces everywhere, one of the most common examples of Pareidolia, to make meaning
where there is none
Prosopagnosia
The specific inability to recognise familiar faces”
•Usually their object recognition is unimpaired
•Right anterior inferior occipital lesions in the region of the occipital-temporal junction
Object Centred vs Viewer Centred
Some neurons responded in view independent way
Neurones in STS responded to facial motion, posture, and eye gaze
Neurons in inferior temporal cortex more likely to respond on the basis of identity
Photofit theory (Penry & Ryan, 1971)
Photographed features to allow subject to build a face.
Composite face effect: Composite faces interfered with recognition of a half face – the effect disappeared upside down. The holistic processing obliges us to combine information automatically.
Thatcher illusion: In the upside down faces we appear to process the features separately and don’t see the strange relationship between them. The holistic processing in the upright faces makes the unusual arrangement instantly salient.
The problem with eye movements:
Reasons for eye movements:
Types:
Measuring eye-movements:
Gaze shifting (orienting) mechanisms - Voluntary
Gaze stabilising mechanisms - Involuntary
Smooth pursuit: We use this to track a moving object, It ensures that the light from the object stays focussed on the back of the eye. Requires a
continuous feedback loop – the eye needs to adjust to the perceived position of the object.
Saccades: Fast, ballistic movements.
Vergence: We move our eyes to focus on objects at different distances away from us.
Fast, ballistic movements
Speeds of up to 700°/s
We usually makes saccades 3-4 times a second.
They show characteristic patterns of acceleration.
Both eyes move together in an almost identical fashion.
Vestibulo-ocular reflex - eye stabilization in space.
These are movements to cancel out the motion of the body and the head – they are in reaction to the signals from the vestibular organs of the inner ear (that give our sense of balance).
They happen for example when we are walking or spinning.
Optokinetic reflex - eye stabilization on the visual scene: These stabilising movements are made in reaction to the whole visual field moving
They can be studied by using rotating drums or tumbling rooms.
Eye tracker (main method): This involves a small camera focused on the eye and software that tracks where the pupil moves. By calibrating the pupil position against fixed points on the screen one can work out where the participant is looking.
Fixation measures:
Using this kind of technology we can build a picture over time of where a participant was looking.
We can measure: dwell time (each fixation), frequency of fixation, total duration of fixations (total looking time), order of fixation positions.
Stabilisation:
Adaption and fading:
It is thought that this is why eye movements evolved.
To build up a visual representation the image on the back of the eye needs to be kept still.
Stabilisation occurs through optokinetic and vestibulo-ocular reflex.
If an image is kept fixed on the retina it starts to fade away in the periphery after a while.
This is called Troxler’s fading (1804!)
It is caused by adaptation of the neurons in the retina, they do not fire after a while to the same stimulus.
Focusing of the fovea:
The retina is not uniformly sensitive.
A central part called the fovea has a high density of photoreceptors.
Most of the processing power is given to this area.
In order to have a detailed representation of a part of the image the light from that part needs to land in the fovea.
The brain needs to know which part of the motion on the retina is caused by eye movements and which part is caused by something moving in the scene.
• This can be done by cancelling out the signal using an “efference copy” from eye muscles.
• Also, cues from the scene, such as whole image motion can be cancelled out.
Passive vs active vision:
Passive: The passive approach sees the eye as just receiving information from the outside world.
Active: The active approach emphasis the dynamic process of the sampling of the visual scenes with reference to the role of the observer in guiding eye movements to relevant parts of the scenes.
Combining motor and visual systems: Brain areas involved in eye movements are active both when visual information is being processed and when eye movements are being made. These are important “cross-modal” areas.
Reading:
We do not fixate every single word.
Regressions account for 10% of saccades.
We easily move our eyes down to the next line.
Using the ‘moving window’ technique it has been shown that we use information from up to 15 letters away in English (less for more complex characters such as Chinese characters only 2-3).
This suggests shape of word plays a role as we are worse at resolving letters outside the fovea.
Eye and attention:
We are not able to process all visual information at once, we are only aware of a small part at any given time.
Usually where we are looking is where we are paying attention to (although not always).
The question is what makes us select something and how do we achieve this selection. How are eye movements and attention linked?
Theories on Eye movements: Active vision emphasises the role of eye movements linked with attention in real-life tasks.
Attention:Taken notice of someone or something; the regarding of someone or something as interesting or important.
We need attention because many stimuli influence different mechanisms, such as recalling memories, planning, decision making, and different parts of our brain. So we can isolate different stimuli, so we can focus, memorize, sort, and understand solely on what we select.
ATTENTION is a crucial element of information processing in the human brain/mind
It sits in the middle of a bottom-up processing from perception to memory, top-down feedback loop.
Orientating (directing) attention: Invalid cue : increase response time, error rate.
Valid cue : decrease response time, error rate. cueing enhances attention, improves decision.
Covert and Overt attention:
Covert: Paying attention without moving the eyes.
Overt: Selectively processing one location by
moving the eyes to that location of interest.
Neuralpsychology- hemi spatial neglect: attentional deficit one half of the visual scene is simply ignored.
Often caused by stroke, typically interrupting blood flow to right parietal lobe – critical in attention and some brain scan of patient suffering from left hemifield neglect.
Affects eating, dressing, reading, navigation, line bisection, cancellation, copying etc.
Change blindness: visual representation is not continuous.
Saccades: We move our eyes approximately 3 times per second > saccadic, suppression.
Changes to a scene that are made during blinks and saccades are difficult to spot.
Eye blinks: Each time we blink our eyes the visual scene is disrupted.
Missing change in natural scenes: W e can miss large changes evereyday: inattentional blindness. "The door study"
Cross-modal interactions: Cross-modal information is crucial for social attention.
The Secrets of Magic:
The Secrets of Magic
• sleight of hand : palming a card
• misdirection : diverting peoples’ attention away from an important event
• memory distortions: false memory altering spectators’ recollection of an event • perceptual distortions e.g. visual illusions
Hearing and language: Sound source is emitting (repeated) circular pressure waves (shells of air compression)
Frequency masking: An auditory phenomenon that occurs when two similar sounds play at the same time, or in the same general location. One masks the other, confusing your perception of either sound.
The ear: Acts as transducer : sound waves > neural signals
Sounds signal events > alarm system emotional balance: listening to music, screeching
communication.
Properties of sound waves: A pure tone is represented by a
sinewave (air pressure as function of space/time)
Amplitude and frequency (1/period, measured in Hz = cycles per second).
Corresponding to perceived loudness and pitch.
E.g musical scales: keys are arranged on the keyboard in the order of rising frequency of the musical tone generated
Harmonic intervals are determined by characteristic frequency ratio.
Harmonics: Frequency & amplitude of pure tones > perceived pitch and loudness. Musical tones are combinations of pure tones : fundamental (determines pitch) + harmonic frequencies (determine timbre)
Middle ear: impedance matching, overload protection.
Inner ear: Frequency analysis, neural encoding.
Outer ear: Directorial microphone.
Increase intensity of mask tone ... until target is no longer audible : ‘threshold’
Vary frequency of masking tone to determine threshold for different frequencies.
Perceived loudness: perceived loudness is measured by comparing successively presented tones (of different frequency) > decide which one sounds louder (2IFC).
intensity of comparison tone is adjusted until the same ‘subjective’ loudness as matched to the reference tone.
Physical intensity (SPL = sound pressure level) is recorded as ‘perceived loudness.
Equal loudness contours are determined by matching the perceived intensity of tone pairs at various base intensities
Human speech:
Consonants cover almost the entire range.
Vowel sounds are mainly in the lower frequency region.
Hearing impairment:
Noise exposure can lead to temporary threshold shifts (auditory fatigue) and permanent (partial) deafness.
Tinnitus: continuous humming or ringing >> leads to suppression.
Presbycusis: selective high- frequency hearing loss with age (ongoing).
Systems used in speech processing: Heard word-Auditory analysis system-->Auditory Input Lexicon--> Semantic system.
Speech- Phoneme Response Buffer-->Spoken Word Lexicon--> Semantic System.
Sound localisation: Location needs to be calculated from a number of cues.
Inter-aural processing to find azimuth (left-right) of sound source :
intensity differences (I I D) : acoustic ‘shadow’ of the head.
Pinnae : crucial for sensation of space (earphones); locate elevation (up-down).
Temporal or phase differences (I T D): inter-aural delays of 10 - 650 μsec
Cocktail party affect: The cocktail party effect is the phenomenon of the brain's ability to focus one's auditory attention on a particular stimulus while filtering out a range of other stimuli.
Binaural unmasking : spatial distance and difference in frequency support separation < difference between ears.
High-level effects (attention, familiarity of voice, language) & sensory fusion.
Masking: the detection of a tone is impaired if another tone or noise is presented at the same time
masking depends on proximity in space and similarity in frequency composition.
Memory systems:
Basis of cognition:
Perception involves memory: Recognition = sensation + prior knowledge.
Knowledge = Storage of past experiences/learning.
Memory allows us to plan and do things in the absence of sensation.
Processes:
Storage: Storing the newly encoded information in memory.
Retrieval: Recovery of previously stored information.
Encoding: Registering new information.
Levels-of-Processing theory
(Craik and Lockhart, 1972).
Rehearsal: Process of encoding memory from short-term to long-term memory.
Maintenance rehearsal:repeating items over and over to maintain them in short-term memory.
Elaborative rehearsal: An encoding strategy to facilitate the formation of memory by linking new information to what one already knows.
Perceptual and comprehension processes leave behind a memory trace.
The “deeper” an item is processed, the better it’s retained.
Important thing to study is not the inherent properties of memory systems, but rather the nature of encoding processes.
memory systems, but rather the nature of encoding processes
Support: Craik and Tulving (1975)
Presented a list of words, Each word could be associated with one of three types of encoding. Imagine one of the words was MEAL...
Three encoding conditions:
- Structural (Orthographic): Is the word in upper case? (Yes)
- Phonological: Does the word rhyme with “mat”? (No)
- Semantic: Does the word fit in this sentence: “the man ate his _” ? (Yes)
Terms:
Retention:
Levels of processing:
Type of processing:
Phonological.
Semantic.
Structural.
Deep
Shallow.
Poor
Good
Encoding-Specificity Principle (Tulving & Thomson, 1973): The likelihood of retrieval depends on the overlap between cues present at encoding and retrieval.
Evidence:
Which of the following cues will be more effective for recalling the word PIANO? – 1)somethingmelodious?,or
– 2)somethingheavy?
• Depends on the learning context (Barclay et al., 1974)
• Participants who learn in this context: The man tuned
the PIANO showed better recall with cue (1) than cue (2)
• Participants who learn in this context: The man lifted
the PIANO showed better recall with cue (2) than cue (1)
• Conclusion: Participants encoded words with their
context
Contextual cues: Inability to recognise a face when it is seen out of context is a common example of the importance of contextual cues.
Intrinsic: features that are integral to the stimulus.
Extrinsic: Other features present at time of encoding
(including one’s own cognitive state).
(Godden and Baddeley, 1975)
Asked divers to learn lists of words on land or underwater.
Recall was then tested either on land or underwater.
State-dependent recall: Recall is better if one’s internal state during recall mirrors one’s state during encoding.
Effect demonstrated for participants under influence of drugs, including:
Alcohol (Goodwin et al., 1969; Overton, 1972),
Caffeine (Kelemen & Creeley, 2001),
Nicotine (Kunsendorf & Wigner, 1985)
Marijuana (Eich et al., 1975)
Forgetting: Often reflects (possibly temporary) inability to access memory (i.e., retrieval failure) rather than a loss of memory.
Testing memory through recognition tests compared to recall tests often reveals that more is available than is necessarily accessible (e.g., Bahrick & Phelps, 1987).
Reasons:
Decay?Perhaps long-term memories gradually fade over time.
Decay theory suggests that rate of forgetting should be fixed over a given period of time, whatever the individual does in that time...
Retention interval is typically confounded with the number of additional experiences.
If forgetting is due to decay, then recall should simply depend on length of retention interval.
If forgetting is due to interference, then recall should depend on amount of similar learning within the retention interval (independently of its length).
According to decay theory, there should be more forgetting after longer retention intervals even though the amount of new information remains constant.
According to interference theory, there should be more forgetting after more new learning even when length of retention interval remains constant.
Proactive Interference: Previous learning disrupts later learning.
Retroactive Interference: Later learning disrupts earlier learning
STM/LTM:
Stort-term memory:
Long-term memory:
Sensory memory:
Literal record of perceptual
experience.
But lost quickly (transient)
Large capacity.
Visual and auditory:
Iconic memory: Holds visual input for 250 msec.
Representation is pre-categorical (literal record of percept)
Echoic memory: Holds auditory input for 2-3 seconds.
Short capacity- 20 seconds
Limited storage: 4-7 items.
Primacy and recency effect:
Primacy: Primacy gradient – First few items can be rehearsed a lot and so more likely to move into long-term memory.
Recency gradient: Last few items of the list are still in STM when you start to write the list down.
Unlimited capacity.
Unlimited storage.
Explict/ Implicit memory:
Explicit memory: when retrieval of a memory is deliberate/requires conscious recollection. Also called declarative memory
Implicit memory:Implicitmemory–whenbehaviour/performance indicates that memories are being retrieved in the absence of a deliberate, conscious, attempt to retrieve them, e.g.,
Episodic vs. Semantic memory:
Episodic memories (memories for
experiences/events)
Semantic memories (general knowledge about the world).
Memory disorders:
Amnesisa:
Causes:
Organic: Acute damage, Degenerative to the brain disease.
Psychogenic (purely psychological)
Retrograde: a loss of memory-access to events that occurred or information that was learned in the past.
Anterograde amnesia: A loss of the ability to create new memories after the event that caused amnesia, leading to a partial or complete inability to recall the recent past, while long-term memories from before the event remain intact.
Causes of organic amnesia
Organic amnesia:Biological (damage to the brain)
What parts of the brain are important for memory:
Amygdala: (Emotional content of episodic memories).
Basal ganglia: (Important for learning motor skills).
Cerebellum: (Memory for automatised skills).
Frontal lobes [working memory (central executive); source monitoring; prospective memory].
Occipital lobes: (Visual Perceptual representation memory).
Hippocampus: Main section for creating and storing memories in the brain.
Case of HM:
HM: operated on in the 1950s to bilaterally remove the medial temporal lobes (about 2/3 of the hippocampus). Following surgery, his epilepsy was greatly improved. Personality was unchanged. IQ went up. However.... Unable to encode/retrieve new episodic memories. HM had developed severe anterograde amnesia.
Korsakoff’s syndrome: Caused by thiamine deficiency (usually due to alcoholism) Damage to diencephalon in particular. Damage to diencephalon in particular. Patients often appear to be drunk – uncoordinated, confused.
Temporal gradient of retrograde amnesia: Recall for events in the time immediately leading up to its onset very poor, but earlier memories relatively intact.
Encoding/retrieval of new episodic long-term memories impaired
(i.e., anterograde amnesia).
Viral encephalitis: Caused by herpes virus crossing blood-brain barrier. Sudden onset of acute fever, headache, nausea. Usually extensive bilateral temporal lobe damage.
Dementia: General degeneration of the brain; atrophy due to growth of neural plaques and tangles. Progressive neural disease.
Pattern of impairments involves 'information-processing' deficits superimposed upon an amnesic syndrome (Morris & Kopelman, 1986)
Amnesia and dissociations between different types of memory:
Amnesia provides evidence for a dissociation between STM and LTM.
Amnesics have poor episodic memory, and often (though not always) have poor semantic memory, but: Still have motor memory (skills), such as making a telephone. call or making coffee (procedural memory).
Show normal sized priming effects.
Still have motor memory (skills), such as making a telephone.
Can learn new procedural skills such as tracking a moving target in the pursuit rotor task.
Several theories have been considered, including:
Accelerated forgetting.
Faulty retrieval.
Faulty encoding.
Faulty encoding/storage of contextual information.
Temporal gradient in retrograde amnesia: Temporal gradient may be due to slow-scale consolidation processes => earlier memories have had longer to be consolidated.
Consolidation theory of retrograde amnesia:
Episodic memories are initially encoded and stored in hippocampus, and retrieving these memories requires reactivating the hippocampus. Over time, the hippocampal memory trace becomes less important, and the cortex can retrieve the memory without the hippocampus.
Consequently, older memories are spared in retrograde amnesia.
Psychogenic amnesia:
Dissociative type:
Fugue states:
Sufferer typically found wandering, often a long way from home.
Triggers include severe stress, depressed mood ,history of transient organic amnesia (Kihlstrom & Schacter, 1995).
Lasts a few hours or days (can be longer).
After recovery, memories from the fugue state are lost.
Very rare.
Refers to loss of memory for specific events due to trauma (no anterograde amnesia).
25-45% of homicide suspects claim amnesia for the crime!
No cases reported before 1800: “dissociative amnesia is not a natural neuropsychological phenomenon, but instead a culture-bound syndrome, dating from the 19th century” (Pope et al., 2007).
These types of psychogenic amnesia are extraordinarily rare (though much more common in fiction!).
Very difficult to rule out malingering as a possible cause.