What are effective features to identify bullying messages?
We will explore particular features in tweets as well as tweet metadata to find features that automatic analysis might not.
Are they useful in selecting data from twitter to train on?
Taken as an absolute, not a high proportion of messages on twitter is bullying. If we can filter data from twitter before analyzing it, we might retrieve more examples of bullying. This would aid in classifying.
What are good classifier algorithms and parameters for this task?
There are many different classification methods. We can compare them to find the most suited one.