Objectivity of the Aleksandr Sinayev PhD Candidate, Quantitative Psychology Ohio State University
About Me Quantitative psychologist Personally interested in applying statistical models to gain insight in any area
What Is Objectivity? Traditional objectivity (report the truth) –Content Pragmatic objectivity – Ward, 1999 – (reports are empirically valid and coherent) –Content
What Is Objectivity? Objectivity as pretense (Tuchman, 1972) –Form (and relationships)
Empirical Investigation Can the form and content of objectivity be reliably measured and compared across content areas?
How Can We Say an Article is Subjective? Could identify subjective elements according to the definition and prior work
How Can We Say an Article is Subjective? Automatized approach A lot of data are available on movies
The Data 2,000 reviews –see Pang, Lee, & Vaithyanathan, 2002 –1,000 positive reviews and 1,000 negative 2,000 synopses –See Bamman, O’Connor, & Smith, 2013 All New York Times articles available online
Preprocessing Common non-diagnostic words deleted –E.g., ‘a’, ‘on’ Numbers were changed to generic features –‘1,023’ => ‘4digitnumber’ –‘4.8’ => ‘singledigitwdecimal’ –Effort made to identify years, dates etc. Features were words and word bigrams Articles were units of analysis
The Classifier Naïve Bayes Trained on 1,500 reviews and 1,500 synopses Tested on 500 reviews and 500 synopses
Examples “I thought”“you will”“is certainly”“were hurt” Synopses ,113 Reviews9851,7571,265400
Did it Work? Classified 80% of the reviews and synopses it was tested on correctly. Most (82%) of the NYT articles were classified as certainly subjective Opinion pieces and editorials were almost always classified as subjective (receiving an average probability of.01 on objectivity) Political news articles averaged.15
Articles Remained Subjective over Time Local, national and international news
What else? Business articles tended to be quite objective (.12) Science and technology articles were more subjective (.01,.02)
What About Front Page? Similar results if counting numbers
Positive or Negative? Trained another algorithm like the one above to distinguish between positive and negative reviews, again achieving over 80% accuracy.
Opinion articles became more positive over time
Other Articles Did Not
Conclusions Objectivity appears to be measurable through simple word pairs News articles appear to concentrate on positive subjective judgments, at least inasmuch as they resemble positive reviews Positivity of articles across time appears to have little to do with positivity of the world across time –Dipped in the 90’s (also small dip in subjectivity)
Limitations Emphasis on form, relationships completely ignored, content partly ignored Objectivity harder to pin down than subjectivity Absolute values of numbers to be taken with a grain of salt
Final Remarks If you are interested and know the literature, help me write this up!