-
Notifications
You must be signed in to change notification settings - Fork 6
Certainty
agshruti12 edited this page Dec 6, 2023
·
2 revisions
Certainty
Given certainty wordlist, with words/phrases and associated certainty score.
- Sort the items in the lexicon first by number of words, and then by number of characters (need to go from longest to shortest)
- For each phrase/word found in the message: assign it the weight associated in the "Certainty" column. Even if a word is repeated, "replace" all instances with the value.
- Calculate the average: sum all Certainty scores and divide by the number of scores. This is the final "Certainty" value. ** Note: If there are no matches between the message and the certainty lexicon, then assign a default certainty score of 4.5. As certainty scores are generated from 0 to 9, with 0 representing UNcertain and 9 representing very certain, we've assumed 4.5 to be a rough representation of a general statement lacking a certainty component.
N/A
N/A
N/A
Validation technique - ensured equivalence between TPM certainty outputs to Lexical Suite certainty outputs. Also added default score of 4.5 to "neutral" statement, i.e. those that presented zero matches with the certainty lexicon.