Skip to content

Zero (phantom, unvoiced) word support. #224

Open
@linas

Description

Zero/phantom words: Expressions such as "Looks good" have an implicit "it" (also called a zero-it or phantom-it) in them; that is, the sentence should really parse as "(it) looks good". The dictionary could be simplified by admitting such phantom words explicitly, rather than modifying the grammar rules to allow such constructions.

Other examples, with the phantom word in parenthesis, include:

  • I ate all (of) the cookies.
  • I taught him (how) to swim.
  • I told him (that) it was gone.
  • (It) looks good.
  • (You) go home!
  • You know you can (do something).
  • I wish I had (done something).
  • I will (do something), if you do.
  • How often (does it happen)?
  • How big (is it)?
  • Room w/sea view (is) available. -- zero copula
  • (some) thieves rob(-bed a) bank! -- newspaper headline

One possible solution to the unvoiced-word problem might be to allow the LG rules to insert alternatives during the early culling stages. This avoids the need to pre-insert all possible alternatives during tokenization...

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions