A Computational Model for Phonetic Interpretation

Motivation

  1. ``We listen to the beginnings and ends of things and just guess the rest,'' said Bill Idsardi in the pscholinguistics lab meeting.
  2. ``Double-Accent Phenomenon'' Mark Greenstreet explained why one of the grad students could not understand a faculty candidate.

Idea

Given a stream of sounds as input, the beginnings and ends of each word and also the beginning and end of the entire stream are more clear than the rest of the stream. So in order to interpret the sounds, we first map the clearer sounds to what we know already and then guess the stuff in between.

Steps Involved

  1. Define the native inventory. A monolingual has one native inventory and a multilingual has one native inventory with the non-active inventory sounds suppressed.
  2. If the listener is not a monolingual, then also define a working inventory. A working inventory is the inventory that holds the sounds of a learning language. Some sounds in the working inventory may be the same as some items in the native inventory.
  3. INPUT = stream of phonetically marked sounds. Some sounds are more clear than others, and some are not clear at all.
  4. Define the mapping and guessing functions. These two functions may interact so it is not so clear to me now what operations belong to which function. The necessary operations for both of these are:
    1. Which sounds do we try to map first? Left->right? One pass only? In parallel?
    2. Search strategy: Do some sounds have a shorter path to the inventory? Which inventory is "closer" to get to? Does there also exist a temporary inventory for each conversation?
    3. When multiple candidates exist, how do we choose the right one?
    4. Conformity: Obviously, at least phonology and semantics will interact here somehow. What exactly do they do?
    5. If a mapping or a guess returns a "wrong" result, do we reanalyze? If so, how many times do we try this? When do we give up?
    6. How are steps (a) to (e) ordered/combined with respect to each other?
  5. OUTPUT = an interpreted sound sequence OR fail

Scenarios

Case 1: Native speaker (A) listens to Native speaker (B).

Case 2: Native speaker (A) listens to Foreign speaker (B).

Case 3: Foreign speaker (A) listens to Native speaker (B).

Case 4: Foreign speaker (A) listens to Foreign speaker (B), where A is different from B.