FILE : wsj_002.txt

-a---   (NP (CD 61) (NNS years) )
-the-a-   (NP (DT the) (NN board) )
-a---   (PP-CLR (IN as)
-a---    (NP (DT a) (JJ nonexecutive) (NN director) ) )
( (S
-a---   (NP (NN chairman) )
---the-     (NP (DT the) (NNP Dutch) (VBG publishing (NN group) )))))
( (S
-a---    (NP (CD 55) (NNS years) ) 
-a---   (CC and )
-a---     (NP (JJ former) (NN chairman) )
-a---      (NP (NNP Consolidated) (NNP Gold) (NNP Fields) (NNP PLC) ))))
-a--- (VP (VBD was) 
-a---  (VP (VBN named)
   (S
-a---     (NP (DT a) (JJ nonexecutive) (NN director) )
-a---      (NP (DT this) (JJ British) (JJ industrial) (NN conglomerate) ))))))
( (S
-a---    (NP (NN asbestos) ) ) )
-a---       (VP (VB make )
-a---        (NP (NNP Kent) (NN cigarette) (NNS filters) ) ) ) ) ) ) )
-a--- (VP (VBZ has )
-a---  (VP (VBN caused)
-a---    ( NP ( DT a ) ( 3J high ) ( NN percentage )
-a---     (NP (NN cancer) (NNS deaths) ) )
-a---    ( PP-LOC ( IN among)
-a---      (NP (DT a) (NN group) )
-a---            ( QP ( RBR more ) ( IN tilan ) ( CD 3 0 )
-a---            (NNS years )
-a---           ( IN ago) ) ) ) ) ) ) ) ) ) ) )
-a--- (NP -SBJ (NNS researchers )
   (S (-NONE- *T*-1.) ) ) )
( (S
-a---   (NP (DT The) (NN asbestos) (NN f. ber)
-a---   (ADJP-PRD (RB unusually) (JJ resilient) )
    (S
---the-      (NP (DT the) (NNS lungs) ) ) ) )
-a---     (VP (VBG causing)
-a---        (WHNP-1 (WDT that )
        (S 
-a---           (NP (NNS decades )
-a---           (JJ later) ) ) ) ) ) ) ) ) ) )
-a--- (NP-SBJ (NNS researchers )
-a--- (VP (VBD said)
   (S (-NONE- *T*-2) ) ) )
( (S
-a---  (NP (NNP Lorillard) (NNP Inc. )
---the-   (NP (DT the) (NN unit) )
-a---     ( ADJP ( JJ New ) ( JJ York-based )
-a---     (WHNP-2 (WDT that )
     (S 
-a---      (VP (VBZ makes)
-a---       (NP (NNP Kent) (NNS cigarettes) ) ) ) ) )
-a---    (NP (PRP$ its) (NN Micronite) (NN ci~arette) (NNS filters) ) )

In the file wsj_002.txt

Nb of sentences : 11
 
 nb of string (a) : 45
 nb of words by itself ( a ) : 4 
 nb of string beginning a word (a...) : 6 
 nb of string finishing a word (...a) : 0
 nb of string inside a word (...a...) : 34 

In the file wsj_002.txt

Nb of sentences : 11
 
 nb of string (the) : 4
 nb of words by itself ( the ) : 4 
 nb of string beginning a word (the...) : 0 
 nb of string finishing a word (...the) : 0
 nb of string inside a word (...the...) : 0 

    Source: geocities.com/gaelmail