-
Notifications
You must be signed in to change notification settings - Fork 6
E.24 Positivity (and Positivity z‐score)
E.24 Positivity
E.24, https://www.cs.cmu.edu/~ylataus/files/TausczikPennebaker2013.pdf
- Use BERT to compute a positivity score
- Compute the z score (of the BERT positivity score)
- Determine a threshold z score (We could do mean i.e the center and anything above the mean)
- Any sentence with a z-score above the threshold value is considered positive.
None
None
Z-score
Refer point 6.
Paper E.24 and relevant citations do not contain exact details about how/ from where to get positive words. We would ideally have to label words as positive and then implement the code.
A previous version of this feature used the LIWC lexicons for positivity, also by Pennebaker.
List of Stop words - English Stopwords in NLTK Original source.
Due to greater reliability of measuring positive features, we shifted to a BERT-based approach rather than a lexical-based approach, and we use the BERT positivity scores to compute the z-scores. However, we retain the measurement of the positivity lexicons in our LIWC-based features (included as separate columns).