You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: documentation/under-the-hood/ranking-notes.md
+15-4
Original file line number
Diff line number
Diff line change
@@ -99,11 +99,11 @@ We currently assign notes a "Not Helpful" status if the max (upper confidence bo
99
99
**Supervised confidence modeling**
100
100
101
101
We also employ a supervised model to detect low confidence matrix factorization results.
102
-
If the model predicts that a note will lose Helpful status, then the note will remain in Needs More Ratings status for up to an additional 180 minutes to allow it to gather a larger set of ratings.
102
+
If the model predicts that a note will lose Helpful status, then the note will remain in Needs More Ratings status for up to an additional 180 minutes or until the supervised model predicts the note will remain rated Helpful.
103
+
The additional time in Needs More Ratings status allows the note to gather a larger set of ratings.
103
104
If after that time the note still meets Helpful standards based on the matrix factorization scoring, the note will be rated Helpful and shown on X.
104
105
In all cases, the final status of the note is determined by matrix factorization.
105
-
The maximum effect of the supervised model is no more than a 180 minute delay.
106
-
All notes will receive a 30 minute delay to gather additional ratings.
106
+
The maximum effect of the supervised model is no more than a 180 minute delay, and all notes will receive a minimum 30 minute delay to gather additional ratings.
107
107
This helps reduce notes briefly showing and then returning to Needs More Rating status.
108
108
109
109
The training data for the supervised confidence model includes all notes that meet the criteria for Helpful status _at some point in time_.
@@ -113,8 +113,16 @@ The features employed by the model include:
113
113
- Aggregate ratios of helpful and non-helpful tags across all ratings for a note
114
114
- Statistics summarizing the Helpful ratings for a note (e.g. standard deviation of user factors from Helpful ratings)
115
115
- Bucket counts of Helpful, Somewhat Helpful and Not Helpful ratings, partitioned by user factor $f_u$ as positive ($f_u >.3$), neutral ($-.3 \leq f_u \leq .3$) and negative ($f_u <-.3$)
116
+
- Burstiness of ratings associated with the note
116
117
117
-
The model uses logistic regression to predict note status outcomes, and is calibrated to delay Helpful status for no more than 60% of notes that ultimately stabilize to Helpful status.
118
+
Since ratings are often a reflection of both a note _and the associated post_, we also include ratings from _other notes on the same post_.
119
+
We refer to other notes on the same post as _peer notes_.
120
+
For example, if a note already has many peer notes that have been frequently assigned the Note Not Needed tag, the tag may reflect the associated post and all proposed notes may experience similar rating outcomes.
121
+
During feature extraction, we vectorize features for peer notes suggesting the post is misleading and not-misleading separately since the meaning of the rating changes depending on agreement between the note and the post.
122
+
123
+
The model uses logistic regression to predict note status outcomes.
124
+
The feature vectorization process involves both discretization and feature crosses, yielding a sparse representation and allowing the model to learn non-linear relationships.
125
+
The model is calibrated to delay Helpful status for no more than 60% of notes that ultimately stabilize to Helpful status.
118
126
119
127
## Tag Outlier Filtering
120
128
@@ -358,6 +366,9 @@ For not-helpful notes:
358
366
359
367
## What’s New?
360
368
369
+
**Mar 18, 2025**
370
+
- Update supervised confidence modeling to include features derived from ratings and scoring outcomes for peer notes located on the same post, as well as rating burstiness features.
371
+
361
372
**Mar 17, 2025**
362
373
- Update topic modeling to give a note more time to gather ratings if it is assigned to a topic, meets Currently Rated Helpful (CRH) criteria, but the topic model does not yet have enough ratings to be confident.
0 commit comments