Mircea Trifan edited Introduction.tex  about 10 years ago

Commit id: 18731e8541535b759c8c11f0b21a72d8e31e414c

deletions | additions      

       

For a corpus of existing tweets, Twitter2011 or 2012 TREC corpus can be used.  Other ideas: As in \cite{Costa_2010} represent streaming phrases as a sum of individual keyword terms as sinusoidal curves and study a feedback loop on keywords for streaming api: for a set of trending keywords identify a new set that is used in the next iteration to feed the system; semantic fields; complex event processing (cep); identify text patterns between entities; unified search for entities; cascading in M3Data (Lingual) + ML (PMML); OLAP cube like operations; UIMA; biginsights; deep, dark web; geo data; combine M3Data security with Accumulo's cell based security; detection theory applied on streams. stream; Networks and SDN.