Mircea Trifan edited Introduction.tex  about 10 years ago

Commit id: 7f051b7edea27294381ad85690e90e76fe0e0b9d

deletions | additions      

       

For a corpus of existing tweets, Twitter2011 or 2012 TREC corpus can be used.  Other ideas: As in \cite{Costa_2010} represent streaming phrases as a sum of individual keyword terms as sinusoidal curves and study a feedback loop on keywords for streaming api: for a set of trending keywords identify a new set that is used in the next iteration to feed the system; semantic fields; complex event processing (cep); identify text patterns between entities; unified search for entities; cascading in M3Data (Lingual) + ML (PMML); OLAP cube like operations; UIMA; biginsights; deep, dark web; geo data; combine M3Data security with Accumulo's cell based security. security; detection theory applied on streams.