ÌÇÐÄÊÓÆµ

November 2, 2016

Make America tweet again

A screenshot from estorm.org showing Twitter sentiment toward or against the major presidential candidates. Credit: estorm.org
× close
A screenshot from estorm.org showing Twitter sentiment toward or against the major presidential candidates. Credit: estorm.org

We know how Donald Trump feels about everyone through Twitter, but how do Twitter users feel about Donald Trump?

Computer scientists from the University of Utah's College of Engineering have developed what they call "sentiment analysis" software that can automatically determine how someone feels based on what they write or say. To test out the accuracy of this software's machine-learning model, the team used it to analyze the individual sentiments of more than 1.6 million (and counting) geo-tagged about the U.S. presidential election over the last five months. A database of these tweets is then examined to determine whether states and their counties are leaning toward the Republicans or Democrats.

"With sentiment analysis, it will try to predict the emotions behind every human being when he or she is talking or writing something," says Debjyoti Paul, a doctoral student in the University of Utah's School of Computing and the project leader along with School of Computing associate professor Feifei Li. "With that in mind, we are not just trying to look at the information in the tweets. We are trying to incorporate the emotion with the information."

As a result of their work, the team has created an interactive website at in which users can find out if the tweets coming out of their state and its counties are more positive or negative toward Republicans or Democrats during any defined period of time since June 5. Also, the data can tell you the percentage of both positive and negative tweets toward a political party and when there was a surge for a particular type of tweet in the last five months.

Get free science updates with Science X Daily and Weekly Newsletters — to customize your preferences!

Some interesting facts about this year's U.S. presidential election based on a sample of what people are tweeting:

Analyzing the tweets

Paul and his team started with more than 250 million tweets posted around the world from June 5 to Oct. 30 and then weeded out all non-political tweets based on a system of keywords using advanced software. They were left with more than 1.6 million political tweets posted in the U.S.

Then those tweets were sifted through the team's "sentiment analysis" software where each tweet was analyzed and assigned a score from 0 to 1 where 0 is the most negative sentiment, 1 is the most positive sentiment, and 0.5 is neutral. The scores are then collected in a database that can calculate a state or county's political leanings in real time based on the tweets. The database is constantly updated with new tweets.

To measure the accuracy of the model, the team compared its results to the and found the state-by-state analysis was very similar.

"I think it works really well. It matches up with the major events that happened during this election season. That's a good indicator that the results are accurate," says Li. "We're hoping to develop some more scientific measurements to confirm this observation for an upcoming paper, but the early results are very positive."

Paul believes that their sentiment analysis software could be used to more accurately reflect the feelings of crowd-sourced opinions on the Internet, for example reviews of products on Amazon or restaurant reviews on Yelp, in which the software can "drill down to the individual sentences of the text" to determine a person's true feelings about something, he says.

He also said that voice-enabled assistants such as iPhone's Siri could use such software to better determine what the user wants, not just based on what he or she says but how they say it.

Provided by University of Utah

Load comments (0)

This article has been reviewed according to Science X's and . have highlighted the following attributes while ensuring the content's credibility:

Get Instant Summarized Text (GIST)

This summary was automatically generated using LLM.