test1234567 added Id_overview_tex_v_1__.tex  about 9 years ago

Commit id: 479cfe491a3c47799beca20b659ee71977acaf96

deletions | additions      

         

% $Id: overview.tex,v 1.39 1994/12/13 04:49:36 fergus Exp $  %\input{stuff}  %\makeindex  %\begin{document}  \section{An Overview of Basic Clause Structure}  \label{list:overview}  This section presents of an overview of basic structure, but it does not  attempt to summarize or justify the entire policy. For an overview of the  new bracketing style, with some notes about its usefulness, see  \cite{marcusetal94} (included on the {\sc cdrom} release as ``arpa94'').  For an overview of the Treebank Project in general, see \cite{marcusetal93}  (``cl93''). Note also that, for the most part, this manual focuses on {\em  problematic}\/ constructions rather than {\em common}\/ ones, for mostly  historical reasons.  The basic structure of an S \index{S} in the Treebank grammar was formerly  (Preliminary Release, Version 0.5, 1992):  \begin{verbatim}  (S (NP Casey)  (VP throws  (NP the ball)))  \end{verbatim}  More complicated structures were annotated with an AUX \index{AUX} node.  \begin{verbatim}  (S (NP Casey)  (AUX will)  (VP throw  (NP the ball)))  \end{verbatim}  \begin{verbatim}  (S (NP Casey)  (AUX should)  (VP have  (VP thrown  (NP the ball))))  \end{verbatim}  The basic approach to simple sentences has not changed. However, we no  longer have a specially bracketed and labeled AUX; the node that was  formerly AUX \index{AUX} now corresponds to the highest level of the VP.  \begin{verbatim}  (S (NP-SBJ Casey)  (VP will  (VP throw  (NP the ball))))  \end{verbatim}  \begin{verbatim}  (S (NP-SBJ Casey)  (VP should  (VP have  (VP thrown  (NP the ball)))))  \end{verbatim}  \subsection{Basic elements of S} \index{S!basic elements}  \subsubsection{Predicate} \index{predicate}  The predicate is either the lowest (right-most branching) VP or (after  copular verbs and in ``small clauses'') a constituent tagged  \index{PRD@-PRD} -PRD. Moved predicates leave a coindexed trace *T* in VP.  \subsubsection{Arguments of the predicate} \index{predicate!arguments of}  \begin{itemize}  \item{External:}  The {\bf surface subject} \index{subject!surface} is tagged -SBJ (subject).  \index{SBJ@-SBJ}  \item{Internal:}  \begin{enumerate}  \item {\bf Direct object NP}\/: \index{objects!direct} occurs after the  verb, has no function tag, and is not followed by another NP.  \item {\bf Indirect object NPs} \index{objects!indirect} are of the  following types:  (i) NPs that occur between the verb and its direct object and have no  function tags, as in dative shift constructions (e.g., {\it gave {\bf Mary}\/  the book}\/)  \index{dative shift}  (ii) dative PPs (e.g., {\it to Mary}\/), \index{dative PPs} tagged -DTV.  Only verbs that can undergo dative shift are considered to have  \index{objects!dative} dative objects.  \end{enumerate}  \end{itemize}  \subsubsection{Level of attachment} \index{attachment!levels of}  \begin{itemize}  \item S-level: \index{S}  The following are attached at S-level: subject NP, highest VP, fronted  constituents, initial and final punctuation, and most modifiers that  precede the verb phrase. When there is no VP (as in ``small clauses''),  the predicate is labeled -PRD \index{PRD@-PRD} , and it and any following  adjuncts are attached at S-level.  \item VP-level: \index{VP}  \begin{enumerate}  \item Almost all modifiers \index{modifiers!postverbal} that follow the  verb are attached under the lowest appropriate VP. When there is  conjunction and the modifier applies to both VPs, the modifier is attached  at conjunction level.  \item An exception is made for modifiers \index{modifiers!as appositives}  \index{apposition!to predicate}  that are interpreted as appositives to the event or the predicate itself.  Such modifiers are adjoined to VP. Some of them may also have a -ADV tag.  \begin{verbatim}  (S (NP-SBJ Investors)  (VP might  (VP (VP appear  (ADJP-PRD unenthusiastic  (PP about  (NP the new issue))))  (SBAR (WHNP-1 which)  (S (NP-SBJ *T*-1)  (VP might  (VP force  (S (NP-SBJ the government)  (VP to  (VP raise  (NP the coupon)  (PP-CLR to  (NP (QP more than 7)  %))))))))))))  \end{verbatim}  \begin{verbatim}  (S (NP-SBJ they)  (VP would  (VP (VP negotiate  (NP rates)  (ADVP-MNR individually)  (PP-CLR with  (NP advertisers)))  ,  (NP-ADV (NP a practice)  (ADJP common  (PP-LOC in  (NP broadcasting)))))))  \end{verbatim}  \begin{verbatim}  (S (NP-SBJ factory inventories)  (VP (VP fell  (NP-EXT 0.1 %)  (PP-TMP in  (NP September)))  ,  (NP (NP the first decline)  (PP-TMP since  (NP February 1987)))))  \end{verbatim}  \end{enumerate}  \end{itemize}  \subsubsection{Complementation within syntactic categories}  \index{complementation}  The complement is attached inside the VP, NP, ADJP, or PP.  \begin{itemize}  \item{Verbs:}  The term ``complement'' as it is used here refers to: \index{complements}  \begin{enumerate}  \item internal arguments \index{arguments!internal} such as NP objects, S  and SBAR with no adverbial dash tags (including some {\it if}\/-clauses, as  in {\it I wonder if the Cubs are winning}\/), and quoted constituents  (including SINV and FRAG)  \item the passive logical-subject {\it by}\/-phrase \index{by-phrases@{\it  by}\/-phrases} \index{subject!logical}  \item VP \index{VP}  \item constituents tagged -BNF, -CLR, -DTV, -PRD, and -PUT  \end{enumerate}  \begin{verbatim}  (S (NP-SBJ-1 the guide)  (VP was  (VP given  (NP *-1)  (PP-DTV to  (NP Arthur))  (PP by  (NP-LGS Ford)))))  \end{verbatim}  \begin{verbatim}  (S (NP-SBJ-1 Casey)  (VP ought  (S (NP-SBJ *-1)  (VP to  (VP have  (VP thrown  (NP the ball)))))))  \end{verbatim}  \item{Nouns:} \index{nouns}  Since it is difficult to consistently annotate an argument/adjunct  distinction, all PP modifiers \index{modifiers!PP} of nouns are  Chomsky-adjoined to the NP:  \begin{verbatim}  (NP (NP a teacher)  (PP of  (NP chemistry)))  \end{verbatim}  However, clausal complements \index{complements!clausal} are recognized:  \begin{verbatim}  (NP the belief  (SBAR that  (S the world is flat)))  \end{verbatim}  \item{Adjectives:} \index{adjectives}  Except in comparatives, any modifier \index{modifiers!postadjectival}  following an adjective is bracketed as a complement.  \begin{verbatim}  (ADJP eager/likely/ready  (S to believe anything))  \end{verbatim}  \begin{verbatim}  (ADJP full  (PP of  (NP life)))  \end{verbatim}  \item{Prepositions:} \index{prepositional phrases}  The NP or S complement \index{complements!of prepositions} of a preposition  is placed inside the PP.  \end{itemize}  \subsubsection{Modification}  \begin{itemize}  \item{Premodifiers:} \index{premodifiers}  Premodifiers generally are placed inside the phrase they are associated  with:  \begin{verbatim}  (NP the red ball)  \end{verbatim}  \begin{verbatim}  (ADJP extremely delicious)  \end{verbatim}  \begin{verbatim}  (ADVP (NP one year) ago)  \end{verbatim}  VP premodifiers, \index{modifiers!preverbal} however, are more often  attached at S-level:  \begin{verbatim}  (S (NP-SBJ Sandy)  (ADVP-TMP often)  (VP throws  (NP curves)))  \end{verbatim}  but they may also be attached inside the VP (see \RefShared\ for more details):  \begin{verbatim}  (S (NP-SBJ Sandy)  (VP (ADVP-MNR sneakily)  threw  (NP a curve)))  \end{verbatim}  \item{Postmodifiers:} \index{postmodifiers}  Postmodifiers of NPs and comparative ADJPs are adjoined to the NP or ADJP:  \begin{verbatim}  (NP (NP a book)  (PP about  (NP toads)))  \end{verbatim}  \begin{verbatim}  (ADJP (ADJP as tall)  (PP as  (NP him)))  \end{verbatim}  Postmodifiers of VP \index{modifiers!postverbal} are attached under VP,  with adverbial function tag(s) where appropriate:  \begin{verbatim}  (VP reading  (PP-CLR about  (NP toads))  (PP-LOC on  (NP the Internet)))  \end{verbatim}  When it is not clear whether a modifier within the VP should be attached at  VP-level or to an object NP, the {\sc default} is to attach at VP-level (see  \RefPseudo).  % (was III.A)  \end{itemize}  \subsection{Clause types} \index{clause types}  We distinguish among a number of basic clause types: S, SINV, SBAR, RRC,  SBARQ, SQ, S-CLF, {\it it}\/-extraposition, and FRAG.  \subsubsection{S} \index{S}  \begin{itemize}  \item{Simple declarative sentences:}  \begin{verbatim}  (S (NP-SBJ Casey)  (VP threw  (NP the ball)))  \end{verbatim}  \item{Passives:} \index{passives}  The surface subject is tagged -SBJ \index{subject!surface} , the passive  trace \index{passive traces} is indicated with (NP *) and coindexed to the  surface subject, the {\it by}\/-phrase \index{by-phrases@{\it  by}\/-phrases} is a child of VP, and the logical subject  \index{subject!logical} is tagged -LGS  \index{LGS@-LGS} . (Note that the -LGS tag goes on the NP and not on the PP of  the {\it by}\/-phrase.)  \begin{verbatim}  (S (NP-SBJ-1 The ball)  (VP was  (VP thrown  (NP *-1)  (PP by  (NP-LGS Casey)))))  \end{verbatim}  \item{Imperatives:} \index{imperatives}  Imperatives are labeled S and given a null subject (NP-SBJ *).  \begin{verbatim}  (S (NP-SBJ *)  (VP Throw  (NP the ball))  !)  \end{verbatim}  If the name of the addressee appears with the imperative (at either the  beginning or end), it is tagged -VOC (vocative) \index{VOC@-VOC} . The  vocative is NOT coindexed to the null surface subject.  \begin{verbatim}  (S (NP-VOC American imperialists)  ,  (NP-SBJ *)  (VP go  (ADVP-DIR home))  !)  \end{verbatim}  \begin{verbatim}  (S (NP-SBJ *)  (VP Close  (NP the door)  ,  (NP-VOC John))  .)  \end{verbatim}  \item{Questions with declarative word order:} \index{questions}  Sentences that end with a question mark but have non-inverted word order  are labeled S:  \begin{verbatim}  (S (NP-SBJ This)  (VP is  (NP-PRD Japan))  ?)  \end{verbatim}  \begin{verbatim}  (S (NP-SBJ You)  (VP did  (NP what))  ?)  \end{verbatim}  However, questions that are missing both subject and auxiliary are labeled  SQ \index{SQ} .  \begin{verbatim}  (SQ (NP-SBJ *)  (VP See  (NP that cute dog))  ?)  \end{verbatim}  \item{Infinitives:} \index{infinitives}  Infinitives are labeled S and take (NP-SBJ *) as the null subject,  where {\it to}\/ represents the highest level of the VP. (See \RefInf\  for more on the annotation of infinitivals.)  \begin{enumerate}  \item{Complement clauses.} \index{complements!clausal}  When the infinitive is a VP complement, the null subject of the infinitive  is coindexed as usual to its logical subject (usually the subject of the  matrix clause, but sometimes the object of the verb or not coindexed at  all).  \begin{verbatim}  (S (NP-SBJ-1 Casey)  (VP wants  (S (NP-SBJ *-1)  (VP to  (VP throw  (NP the ball))))))  \end{verbatim}  \item{Purpose clauses.} \index{purpose clauses}  Purpose clauses are attached at S and labeled -PRP (purpose/reason)  \index{PRP@-PRP} . The subject is coindexed to the surface subject of the  matrix clause when there is a coindexed interpretation.  \begin{verbatim}  (S (NP-SBJ-1 Sue)  (VP arrived  (ADVP-TMP early)  (S-PRP (NP-SBJ *-1)  (VP to  (VP get  (NP a good seat))))))  \end{verbatim}  \item{Infinitival relatives.} \index{infinitival relatives}  In the case of infinitival relatives, the relative is adjoined to NP and  dominated by SBAR with a zero {\it wh}\/-complementizer labeled according  to the role played by the gapped constituent. A *T* in the position of the  gap is coindexed to the {\it wh}\/-complementizer. The (NP-SBJ *) is not  indexed.  %A- never subject IRC??  \begin{verbatim}  (NP (NP a movie)  (SBAR (WHNP-1 0)  (S (NP-SBJ *)  (VP to  (VP see  (NP *T*-1))))))  \end{verbatim}  \end{enumerate}  \item{Participial and gerund clauses:} \index{participles} \index{gerunds}  Participial clauses have full clause structure, with either a lexical or  null (NP-SBJ *) subject. (See \RefGP\ for more on the annotation of  participial and gerund clauses.)  ``Floating'' participles are tagged -ADV.  \index{participles!floating@``floating''}   \begin{verbatim}  (S (S-ADV (NP-SBJ The crowd)  (VP cheering  (ADVP-MNR madly)))  ,  (NP-SBJ Willie)  (VP caught  (NP the ball)))  \end{verbatim}  When appropriate, the null subject is coindexed to an NP in the matrix  clause (generally the logical/structural subject).  \begin{verbatim}  (S (S-ADV (NP-SBJ *-1)  (VP Running  (PP-DIR toward  (NP Casey))))  ,  (NP-SBJ-1 Willie)  (VP caught  (NP the ball)))  \end{verbatim}  Gerunds \index{gerunds} that act as the surface subject or as the object of  a preposition are tagged -NOM \index{NOM@-NOM} .  \begin{verbatim}  (S (S-NOM-SBJ (NP-SBJ *)  (VP Baking  (NP pies)))  (VP is  (ADJP-PRD fun)))  \end{verbatim}  \begin{verbatim}  (S (PP With  (S-NOM (NP-SBJ interest rates)  (VP rising)))  ,  (NP-SBJ the market)  (VP is  (VP moving  (ADVP-MNR slowly))))  \end{verbatim}  Verb complement gerunds \index{gerunds!as complement of verb} , including  those in ``serial'' verb constructions, are all bracketed as a simple S,  with coindexed subject when appropriate.  \begin{verbatim}  (S (NP-SBJ I)  (VP do not  (VP mind  (S (NP-SBJ you(r))  (VP leaving  (ADVP-TMP early))))))  \end{verbatim}  \begin{verbatim}  (S (NP-SBJ-1 The audience)  (VP keeps  (S (NP-SBJ *-1)  (VP leaving  (ADVP-TMP early)))))  \end{verbatim}  SBAR complement gerunds \index{gerunds!as complement of SBAR} are also  bracketed as a simple S.  \begin{verbatim}  ( (S (NP-SBJ-1 He)  (VP ate  (NP television)  (SBAR-TMP while  (S (NP-SBJ *-1)  (VP watching  (NP dinner)))))  .))  \end{verbatim}  \end{itemize}  \subsubsection{SINV} \index{SINV}  The SINV label is used for subject-auxiliary inversion \index{inversion} in  the case of negative inversion, conditional inversion, locative inversion,  and some topicalizations. (SINV is not used with questions. See  section \ref{list:overview:clausetype:sq:y/n:inversion} and  section \ref{list:overview:clausetypes:sbarq:wh-questions:inversion} for the  treatment of subject-auxiliary inversion in the case of yes/no questions  and {\it wh}\/-questions, respectively.) Inverted auxiliaries are  unlabeled.  \begin{verbatim}  (SINV (ADVP-TMP Never)  had  (NP-SBJ I)  (VP seen  (NP such a place)))  \end{verbatim}  When the inversion results in a conditional clause \index{conditionals}  (i.e., when it is equivalent to (SBAR-ADV if... ), the SINV is enclosed in  SBAR-ADV).  \index{SBAR-ADV} \index{if-clauses@{\it if}\/-clauses}  \begin{verbatim}  (S (SBAR-ADV (SINV had  (NP-SBJ Casey)  (VP thrown  (NP the ball)  (ADVP-MNR harder))))  ,  (NP-SBJ it)  (VP would  (VP have  (VP reached  (NP home plate)  (PP-TMP in  (NP time))))))  \end{verbatim}  When subject-aux inversion \index{inversion} is triggered by the predicate  (or a piece thereof) moving out of the VP to a position preceding the  subject, the moved predicate phrase is tagged -TPC (topicalized)  \index{TPC@-TPC} and leaves a coindexed trace in the VP. Note that in this  case, the auxiliaries that precede the subject are labeled VP (whereas in  other cases of inversion they are left unlabeled), so that there's a place  to properly attach the VP trace.  \begin{verbatim}  (SINV (VP-TPC-1 Marching  (PP-DIR past  (NP the reviewing stand)))  (VP were  (VP *T*-1))  (NP-SBJ 500 musicians))  \end{verbatim}  \begin{verbatim}  (SINV (ADJP-PRD-TPC-2 (ADVP Even more)  unusual)  (VP is  (ADJP-PRD *T*-2))  (NP-SBJ (NP the mating behavior)  (PP of  (NP the praying mantis))))  \end{verbatim}  \begin{verbatim}  (SINV (ADVP Also)  (ADJP-PRD-TPC-11 present)  (VP will  (VP be  (ADJP-PRD *T*-11)))  (NP-SBJ (NP (NP the bride 's)  children)  ,  (NP Joan and Kirkland)))  \end{verbatim}  \begin{verbatim}  (SINV (ADVP-DIR-TPC-3 Out)  (VP might  (VP have  (VP popped  (ADVP-DIR *T*-3))))  (NP-SBJ a jack-in-the-box))  \end{verbatim}  \begin{verbatim}  (SINV (S-TPC-4 (NP-SBJ We)  (VP will  (VP win)))  ,  (VP said  (SBAR 0  (S *T*-4))  (NP-SBJ Casey)))  \end{verbatim}  \subsubsection{SBAR} \index{SBAR}  SBAR is used for relative clauses and subordinate clauses, including  indirect questions. \index{questions!indirect}  \begin{verbatim}  (S (NP-SBJ (NP The person)  (SBAR (WHNP-1 who)  (S (NP-SBJ *T*-1)  (VP threw  (NP the ball)))))  (VP is  (ADJP-PRD very athletic)))  \end{verbatim}  \begin{verbatim}  (S (NP-SBJ Willie)  (VP knew  (SBAR that  (S (NP-SBJ Casey)  (VP threw  (NP the ball))))))  \end{verbatim}  \begin{verbatim}  (S (NP-SBJ Willie)  (VP asked  (SBAR (WHNP-1 who)  (S (NP-SBJ *T*-1)  (VP threw  (NP the ball))))))  \end{verbatim}  The {\it wh}\/-prefixed labels, WHNP, \index{WHNP} WHADVP, \index{WHADVP}  WHADJP, \index{WHADJP} WHPP, \index{WHPP} are used only when there is {\it  wh}\/-movement, \index{wh-movement@{\it wh}\/-movement} and they always  leave a trace *T*. (See \RefWHph\ \index{wh-clauses@{\it  wh}\/-clauses} for more information.)  \begin{verbatim}  (S (NP-SBJ-1 The committee)  (VP continued  (NP its meeting)  (SBAR-TMP while  (S (NP-SBJ *-1)  (VP eating  (NP lunch))))))  \end{verbatim}  \subsubsection{RRC (reduced relative clause)} \index{RRC} \index{relative  clauses}  Reduced relative clauses are adjoined to the NP they modify.  %How are we going to refer readers of the overview who want more detailed  %info to the appropriate section of the manual? The section on reduced  %relatives, for example, is deep in Gerunds-Participles and might require  %some searching through the table of contents to find. Shall we give  %references for every section heading in here?  %  %A- the reference below seems OK to me.  \begin{verbatim}  (NP (NP An orangutan)  (VP foaming  (PP-CLR at  (NP the mouth))))  \end{verbatim}  All sentential modifiers in reduced relatives are attached at VP level,  even when they precede the verb itself.  \begin{verbatim}  (NP (NP a car)  (VP not built  (NP *)  (PP by  (NP-LGS Mazda))  (PP-TMP for  (NP the last five years))))  \end{verbatim}  The RRC label \index{RRC} is used only in cases where there is no VP and an  extra level is needed for proper attachment of sentential modifiers (see  \RefGP).  %was VII.A.2  \begin{verbatim}  (NP (NP 110 titles)  (RRC not  (ADVP-TMP presently)  (PP-LOC in  (NP the collection))))  \end{verbatim}  In the case of passives, \index{relative clauses!passive} a passive trace  (NP *) is inserted under the VP, but it is not indexed, since the null  subject to which the passive trace \index{passive traces} would be  coindexed is not present in the annotation. The logical subject is tagged  -LGS.  \begin{verbatim}  (S (NP-SBJ I)  (VP bought  (NP (NP a car)  (VP built  (NP *)  (PP by  (NP-LGS Mazda))))))  \end{verbatim}  Compare with the unreduced:  \begin{verbatim}  (S (NP-SBJ I)  (VP bought  (NP (NP a car)  (SBAR (WHNP-5 which)  (S (NP-SBJ-6 *T*-5)  (VP was  (VP built  (NP *-6)  (PP by  (NP-LGS Mazda)))))))))  \end{verbatim}  \subsubsection{SBARQ} \index{SBARQ}  \label{list:overview:clausetypes:sbarq:wh-questions:inversion}  The SBARQ label marks {\it wh}\/-questions \index{wh-questions@{\it  wh}\/-questions} (i.e., those that contain a gap and therefore require a  trace). A further level of structure, SQ, \index{SQ} contains the inverted  auxiliary (if there is one) and the rest of the sentence. The inverted  auxiliary in {\it wh}\/-questions is not labeled. \index{inverted auxiliary}  \begin{verbatim}  (SBARQ (WHNP-1 Who)  (SQ (NP-SBJ *T*-1)  (VP threw  (NP the ball)))  ?)  \end{verbatim}  \begin{verbatim}  (SBARQ (WHNP-2 What)  (SQ did  (NP-SBJ Casey)  (VP throw  (NP *T*-2)))  ?)  \end{verbatim}  \begin{verbatim}  (SBARQ (WHNP-3 Who)  (SQ (NP-SBJ *T*-3)  (VP will  (VP throw  (NP the ball))))  ?)  \end{verbatim}  If the main verb is inverted, \index{inversion} most arguments and adjuncts  go at SQ level:  \begin{verbatim}  (SBARQ (WHADVP-1 Why)  (SQ (PP-LOC in  (NP these movies))  is  (NP-SBJ the unwed pregnant woman)  (ADVP-TMP always)  (PP-PRD from  (NP Ohio))  (ADVP-PRP *T*-1))  ?)  \end{verbatim}  \subsubsection{SQ} (See also section \ref{list:overview:s-clf:sq-clf}.)  \index{SQ}  \begin{itemize}  \item{inside SBARQ:}  As described above, inside {\it wh}\/-questions, SQ holds the subject,  inverted auxiliary (if any), main verb phrase, and some adjuncts.  \item{yes/no questions:}  \label{list:overview:clausetype:sq:y/n:inversion}  SQ is used for yes/no questions \index{questions!yes/no} (i.e., those with  inversion but no {\it wh}\/-movement).  \begin{verbatim}  (SQ Did  (NP-SBJ Casey)  (VP throw  (NP the ball))  ?)  \end{verbatim}  \item{subject-less yes/no questions:} \index{questions!yes/no}  In questions where the auxiliary and subject do not appear, the auxiliary  is unlabeled and a null subject (NP-SBJ *) is used.  \begin{verbatim}  (SQ (NP-SBJ *)  (VP See  (NP that cute dog))  ?)  \end{verbatim}  Note that questions with overt subjects and auxiliaries that show  declarative word order are simply labeled S.  \item{Tag questions:} \index{questions!tag}  Tag questions are treated as an adjunction of SQ to S. The resulting  structure is labeled SQ, since the whole thing is interrogative in nature.  The lower SQ is annotated to show predicate deletion; that is, an  appropriate null *?* is inserted.  \begin{verbatim}  (SQ (S But  (NP-SBJ you)  (VP knew  (NP that)))  ,  (SQ did n't  (NP-SBJ you)  (VP *?*))  ?)  \end{verbatim}  \begin{verbatim}  (SQ (S (NP-SBJ That)  (VP 's  (NP-PRD the problem)))  ,  (SQ is  n't  (NP-SBJ it)  (NP-PRD *?*))  ?)  \end{verbatim}  \end{itemize}  \subsubsection{S-CLF} ({\it it}\/-cleft or ``true'' cleft) \index{CLF@-CLF}  \index{clefts} \index{it-clefts@{\it it}\/-clefts}  Declarative {\it it}\/-clefts are labeled S-CLF, expletive {\it it}\/ is  tagged as the surface subject (-SBJ), the SBAR is attached at VP-level, and  a trace is coindexed to the {\it wh}\/-complementizer of the clefted  portion. (See \RefIt\ for more information.)  \begin{verbatim}  (S-CLF (NP-SBJ It)  (VP was  (NP Casey)  (SBAR (WHNP-1 who)  (S (NP-SBJ *T*-1)  (VP threw  (NP the ball))))))  \end{verbatim}  Traces for all adverbials, including purpose clauses, are labeled (ADVP  *T*) and coindexed to the WHADVP complementizer. If a preposition is  pied-piped, WHPP and (PP *T*) are used.  \begin{verbatim}  (S-CLF (NP-SBJ It)  (VP is  (ADVP-TMP-PRD then)  (SBAR (WHADVP-1 that)  (S (NP-SBJ-2 young queens)  (VP begin  (S (NP-SBJ *-2)  (VP to  (VP appear)))  (ADVP-TMP *T*-1))))))  \end{verbatim}  Interrogative {\it it}\/-clefts are labeled SQ-CLF:  \index{clefts!interrogative}  \label{list:overview:s-clf:sq-clf}  \begin{verbatim}  (SQ-CLF Was  (NP-SBJ it)  (NP-PRD John)  (SBAR (WHNP-1 who)  (S (NP-SBJ *T*-1)  (VP came  (PP-DIR to  (NP the party))  (PP-CLR in  (NP a dress)))))  ?)  \end{verbatim}  Note: {\it wh}\/-clefts do not receive special treatment in the corpus.  They contain a free/headless relative, followed by a form of the verb {\it  be}\/, followed by a predicate (e.g., {\it What sustained the worst  injuries / was / the car}\/). (See \RefNotation\ for a bit more on free  relatives.)  \index{wh-clefts@{\it wh}\/-clefts} \index{clefts}  \subsubsection{{\it it}\/-extraposition} \index{it-extraposition@{\it  it}\/-extraposition}  Clauses that are extraposed from subject position are labeled S or SBAR.  The extraposed clause is attached at VP level and adjoined to the ``it''  with *EXP*-attach. The NP containing {\it it}\/ and *EXP* is tagged -SBJ.  (See \RefExp\ for more information.)  \begin{verbatim}  (S (NP-SBJ (NP It)  (S *EXP*-1))  (VP is  (NP-PRD a pleasure)  (S-1 (NP-SBJ *)  (VP to  (VP teach  (NP her))))))  \end{verbatim}  \subsubsection{FRAG} (fragment) \index{FRAG}  FRAG marks those portions of text that appear to be clauses, but lack too  many essential elements for the exact structure to be easily determined  (e.g., answers to questions). Predicate argument structure therefore  cannot be extracted from FRAGs. Some examples of what we have called FRAG:  \begin{verbatim}  (SBARQ (WHNP-9 Who)  (S (NP-SBJ *T*-9)  (VP threw  (NP the ball)))  ?)  (FRAG (NP Casey)  ,  (NP-TMP yesterday))  \end{verbatim}  \begin{verbatim}  (S (SBAR-ADV if  (FRAG not  (NP-TMP today)))  ,  Casey will throw the ball tomorrow)  \end{verbatim}  \begin{verbatim}  (SBAR-ADV Though  (FRAG (ADJP limited)))  \end{verbatim}  \begin{verbatim}  (SBAR-ADV if  (FRAG (ADJP possible)))  \end{verbatim}  \begin{verbatim}  (SBAR-ADV (WHNP Whatever)  (FRAG (NP the long-term economic effect)))  \end{verbatim}  \begin{verbatim}  (FRAG (PP-LOC Among  (NP (NP the Guinness disk 's)  wonders))  :  (NP (NP the world 's)  loudest recorded belch)  .)  \end{verbatim}  \begin{verbatim}  ( (FRAG (NP (NP Two guys)  (PP from  (NP (NP Gary)  ,  (NP Ind.))))  ?  ''))  \end{verbatim}  \begin{verbatim}  ( (FRAG Not  (ADJP so)  .))  \end{verbatim}  \begin{verbatim}  ( (FRAG (VP Guaranteed  (NP *)  (PP by  (NP-LGS India)))  .))  \end{verbatim}  \subsection{Clause combinations}  \subsubsection{Coordination} \index{coordination}  See \RefShared\ and \RefCoord\ for details concerning coordination  structures.  \begin{itemize}  \item Phrase coordination  Coordination of phrases is represented in the annotation at the lowest  level possible. Single words are assumed to coordinate at word level  rather than projecting their own phrases, and only the highest level is  represented.  \begin{verbatim}  (S (NP-SBJ Girls and boys)  (VP throw and catch  (NP balls)))  \end{verbatim}  However, the addition of modifiers generally forces a higher level of  coordination, which is shown with Chomsky-adjunction structure.  \begin{verbatim}  (S (NP-SBJ (NP These girls)  and  (NP those boys))  (VP (VP throw  (ADVP-MNR well))  and  (VP catch  (ADVP-MNR badly))))  \end{verbatim}  \item Clause coordination  When like clauses are coordinated, the level of coordination has the same  label as the coordinated clauses.  \begin{verbatim}  (S (S (NP-SBJ Casey)  (VP threw  (NP the ball)))  and  (S (NP-SBJ Willie)  (VP caught  (NP it))))  \end{verbatim}  \begin{verbatim}  (S (NP-SBJ Jackie)  (VP knew  (SBAR (SBAR (WHNP-1 who)  (S (NP-SBJ *T*-1)  (VP threw  (NP the ball))))  and  (SBAR (WHNP-2 who)  (S (NP-SBJ *T*-2)  (VP caught  (NP it)))))))  \end{verbatim}  Coordinated unlike clauses are dominated by S.  \begin{verbatim}  (S (S (NP-SBJ He)  (VP was  (VP performing  (PP for  (NP pay)))))  , and  (SBARQ (WHADVP-1 why)  (SQ should  (NP-SBJ anyone)  (VP expect  (NP (NP anything)  (ADJP more))  (ADVP-PRP *T*-1))))  ?)  \end{verbatim}  Note that coordination of unlike phrases (i.e., non-clauses) is dominated  by UCP (Unlike Coordinated Phrase). \index{UCP}  \item Coordinating conjunctions \index{conjunctions!coordinating}  \index{coordination}  Besides the usual {\it and, or, but}\/, etc., certain prepositions and  subordinating conjunctions can be used as coordinating conjunctions.  Multi-word coordinating conjunctions are labeled CONJP \index{CONJP} (see  \RefCoord).  %was I.  \begin{verbatim}  (S (NP-SBJ Willie  (CONJP as well as)  Casey)  (VP saw  (NP the ball)))  \end{verbatim}  \begin{verbatim}  (S (NP-SBJ Casey)  (VP will  (VP throw  (NP the ball)  (NP-TMP tomorrow  (CONJP if not)  today))))  \end{verbatim}  Otherwise, these are annotated as PPs or SBARs, as appropriate. (See  \RefCoord.)  %was Coordination.I.B.)  \begin{verbatim}  (S (SBAR-ADV if  (FRAG not  (NP today)))  ,  (NP-SBJ Casey)  (VP will  (VP throw  (NP the ball)  (NP-TMP tomorrow))))  \end{verbatim}  \end{itemize}  \subsubsection{Subordination: the use of SBAR} \index{SBAR}  See \RefWHph\ and \RefSubord\ for more detail.  \begin{itemize}  \item Relative clauses are adjoined to the NP that they modify.  \index{relative clauses}  \begin{verbatim}  (S (NP-SBJ (NP The person)  (SBAR (WHNP-1 who)  (S (NP-SBJ *T*-1)  (VP caught  (NP the ball)))))  (VP is  (ADJP-PRD very athletic)))  \end{verbatim}  \item SBARs can be verbal complements.  \begin{verbatim}  (S (NP-SBJ Willie)  (VP knew  (SBAR that  (S (NP-SBJ Casey)  (VP hid  (NP the ball))))))  \end{verbatim}  \index{SBAR}  \item Conditional, temporal, and other such adverbial SBARs are  attached under either S or VP, depending on whether they precede or follow  the main clause, and given the appropriate adverbial function tag.  \begin{verbatim}  (S (SBAR-ADV if  (S (NP-SBJ Casey)  (VP throws  (NP the ball))))  (NP-SBJ Willie)  (VP catches  (NP it)))  \end{verbatim}  \begin{verbatim}  (S (NP-SBJ Willie)  (VP catches  (NP the ball)  (SBAR-ADV if  (S (NP-SBJ Casey)  (VP throws  (NP it))))))  \end{verbatim}  \end{itemize}  \subsubsection{Fronted elements} \index{fronted constituents}  Fronted elements are those that appear before the subject in a declarative  sentence. They are placed inside the top clause level (e.g. S, SINV, SQ,  SBAR). (See the section on -TPC in \RefNotation\ for more details on the  use of the -TPC tag and the section on *T* with fronted elements in  \RefNulls\ for more details on the distribution of *T*.)  \begin{itemize}  \item Fronted arguments.  Fronted arguments are attached under the main clause level. They always  leave a *T* and are tagged -TPC. This holds whether the argument is  fronted within a single clause or crosses more than one clause boundary.  \begin{verbatim}  (S (NP-TPC-5 This)  (NP-SBJ every man)  (VP contains  (NP *T*-5)  (PP-LOC within  (NP him))))  \end{verbatim}  \item Fronted adjuncts. \index{fronted constituents} \index{TPC@-TPC}  Fronted adjuncts receive function tags (e.g, -ADV, -TMP) as appropriate.  However, note that they receive the -TPC label only in cases where they are  associated with a *T* in a lower clause. In particular, a *T* appears in  the annotation only if the adjunct is fronted over more than one clause  boundary:  \begin{verbatim}  (S (PP-TPC-1 Excluding  (NP (NP (NP an increase)  (PP-LOC in  (NP the tax rate)))  and  (NP (NP the effects)  (PP of  (NP foreign currency translations)))))  ,  (NP-SBJ Mr. Millis)  (VP said  (SBAR 0  (S (NP-SBJ (NP the company 's)  results)  ``  (VP were  (ADVP still)  (ADJP-PRD (NP-ADV a little)  disappointing)  (PP *T*-1)))))  . '')  \end{verbatim}  \begin{verbatim}  (S (SBAR-ADV-TPC-1 If  (S (NP-SBJ profits)  (VP do n't  (VP improve))))  ,  (NP-SBJ Mr. Whitten)  (VP says  (SBAR 0  (S (NP-SBJ he)  (VP may  (VP quit  (NP the exchange)  (SBAR-ADV *T*-1))))))  .)  \end{verbatim}  There is no *T* in the annotation if the adjunct is fronted within a single  clause:  \index{fronted constituents}  \begin{verbatim}  (S (NP-TMP Yesterday)  (NP-SBJ I)  (VP went  (PP-DIR to  (NP the store))))  \end{verbatim}  \begin{verbatim}  (S (S-ADV (NP-SBJ *-1)  (VP Running  (PP-DIR toward  (NP Casey))))  ,  (NP-SBJ-1 Willie)  (VP caught  (NP the ball)))  \end{verbatim}  \begin{verbatim}  (S (SBAR-TMP (WHADVP-7 When)  (S (NP-SBJ I)  (VP do n't  (VP get  (NP enough sleep)  (ADVP-TMP *T*-7)))))  (NP-SBJ-8 I)  (VP have  (NP (NP trouble)  (S-NOM (NP-SBJ *-8)  (VP staying  (ADJP-PRD awake))))))  \end{verbatim}  Elements that are fronted within questions (either {\it wh}\/- or yes/no)  are put inside the highest level of structure (SBARQ or SQ, respectively).  \index{questions}  \index{fronted constituents}  \begin{verbatim}  (SBARQ (SBAR-ADV If  (S (NP-SBJ Casey)  (VP throws  (NP the ball))))  (WHNP-1 who)  (SQ will  (NP-SBJ *T*-1)  (VP catch  (NP it)))  ?)  \end{verbatim}  \begin{verbatim}  (SQ (SBAR-ADV If  (S (NP-SBJ Casey)  (VP throws  (NP the ball))))  will  (NP-SBJ Willie)  (VP catch  (NP it))  ?)  \end{verbatim}  \end{itemize}  \subsubsection{Quotations} \index{quotations}  \begin{itemize}  \item Direct quotations \index{quotations!direct}  A direct quotation is considered to be the argument of the verb of saying.  \begin{verbatim}  (S (NP-SBJ Casey)  (VP said  ``  (S (NP-SBJ Willie)  (VP caught  (NP the ball))))  .  '')  \end{verbatim}  When the quotation appears before the quoting verb, it treated as a fronted  argument: the quote is attached at S level and given a -TPC tag, and a  trace is shown under the VP.  \begin{verbatim}  (S ``  (S-TPC-1 (NP-SBJ Willie)  (VP caught  (NP the ball)))  ,  ''  (NP-SBJ Casey)  (VP said  (S *T*-1))  .)  \end{verbatim}  \begin{verbatim}  (SINV ``  (S-TPC-1 (NP-SBJ Willie)  (VP caught  (NP the ball)))  ,  ''  (VP said  (S *T*-1))  (NP-SBJ Casey)  .)  \end{verbatim}  If the quotation is discontinuous, the interruptive material is annotated  as a parenthetical (-PRN). Note that a trace appears under the VP in the  parenthetical, but that the fronted portion is not labeled -TPC:  \begin{verbatim}  (S-7 ``  (NP-SBJ Willie)  ''  (PRN ,  (S (NP-SBJ Casey)  (VP said  (S *T*-7)))  ,)  ``  (VP caught  (NP the ball))  .  '')  \end{verbatim}  \item Indirect quotations \index{quotations!indirect}  The (SBAR 0) level marks indirect quotations and cases where only a portion  of the quote is direct.  \index{SBAR 0}  \begin{verbatim}  (S (NP-SBJ Stokely)  (VP says  (SBAR 0  (S (NP-SBJ stores)  (VP revive  (NP (NP specials)  (PP like  (NP (NP three cans)  (PP of  (NP peas))  (PP for  (NP 99 cents)))))))))  .)  \end{verbatim}  \begin{verbatim}  (S (NP-SBJ Mr. Millis)  (VP said  (SBAR 0  (S (NP-SBJ (NP the company 's)  results)  ``  (VP were  (ADVP still)  (ADJP-PRD (NP-ADV a little)  disappointing)))))  . '')  \end{verbatim}  \end{itemize}  \subsubsection{Gapping} \index{gapping}  See \RefCoord\ for more details on gapping.  \begin{itemize}  \item Intrasentential gapping \index{gapping!template}  \index{gapping!intrasentential}  When the gapped construction exists alongside a complete (ungapped) clause  of parallel structure, the complete clause is used as a template to which  elements in the gapped clause are mapped via the ``='' notation (referred  to as ``gap coindexing''). \index{gap coindexing}  \begin{verbatim}  (S (S (NP-SBJ-1 Mary)  (VP likes  (NP-2 Bach)))  and  (S (NP-SBJ=1 Susan)  ,  (NP=2 Beethoven)))  \end{verbatim}  In the above, the equal sign notation maps constituent NP=1 over NP-1 so  that the following predicate argument structure can be extracted: {\sc  like}\/ (Mary, Bach) and {\sc like}\/ (Susan, Beethoven). Similarly:  \begin{verbatim}  (S (NP-SBJ John)  (VP (VP gave => GIVE (John, Mary, book)  (NP-1 Mary)  (NP-2 a book))  and  (VP (NP=1 Bill) => GIVE (John, Bill, pencil)  (NP=2 a pencil))))  \end{verbatim}  All constituents in gapped constructions receive function tags as  appropriate.  \begin{verbatim}  (S (NP-SBJ I)  (VP (VP eat  (NP-1 breakfast)  (PP-TMP-2 in  (NP the morning)))  and  (VP (NP=1 lunch)  (PP-TMP=2 in  (NP the afternoon)))))  \end{verbatim}  \item Intersentential gapping \index{gapping!intersentential}  The template approach to gapping is not used across sentences. If a  template is not available within the same sentence, the gapped constituent  is labeled FRAG instead.  {\it What is Tim eating? Mary Ann thinks chocolate.}  \begin{verbatim}  (S (NP-SBJ Mary Ann)  (VP thinks  (SBAR 0  (FRAG (NP chocolate)))))  \end{verbatim}  \end{itemize}  %\printindex  %\end{document}