Sentiment Analysis and Data visualization Techniques of Experiments

Slides:



Advertisements
Similar presentations
To achieve a Level 7 you need to…. To achieve a Level 6 you need to…
Advertisements

Problem Solving in Science The Scientific Method.
Carrying Out an Investigation in Science
Science Fair Project 2015.
Prelude to the Research Validity Lecture A RH: is a guess about the relationships between behaviors In order to test our RH: we have to decide on a research.
Meta-Chart is a free app that allows you to design and communicate information visually. The.
Objective Students will be able to identify important elements within a nonfiction text Standards: – W9: Draw evidence from literary or informational texts.
SCIENTIFIC METHOD. 1.1 Observe. It is curiosity that breeds new knowledge. The process of observation, sometimes called "defining the question," is simple.
The Scientific Method. The Scientific Method The Scientific Method is a problem solving-strategy. *It is just a series of steps that can be used to solve.
Big Idea 1: The Practice of Science Description A: Scientific inquiry is a multifaceted activity; the processes of science include the formulation of scientifically.
Scientific Inquiry & Skills
Mohawk Junior High Showcase of Learning What is a Science Project?
What Do Scientists Do? Quiz 1C.
Chapter Three: The Scientific Process  3.1 Inquiry and the Scientific Method  3.2 Experiments and Variables  3.3 The Nature of Science and Technology.
The Scientific Method aka: Scientific Inquiry. What is Science? The goal of science is to investigate and understand the natural world, to explain events.
Scientific Method Review.  The scientific method is used by scientists to solve problems  It is organized and reproducible (can be repeated by other.
 Set up is January 7, 2016  Judging is January 8, 2016.
CALIFORNIA WINE CLUB WWW. JAVICELLARS.COM. CALIFORNIA WINE CLUB.
Scientific Method.
Mathematical Derivation of Probability
Madhu Karra1, Jason Koglin1,2
AP CSP: Cleaning Data & Creating Summary Tables
Image Processing For Soft X-Ray Self-Seeding
Characterizing Optics with White-Light Interferometry
Matter in Extreme Conditions at LCLS:
Crystallography images
Hutch 4 (XCS) Laser System
Anomaly Detection LCLS data: Introduction Research Conclusion
MAKING WAY FOR LCLS II HERRIOTT CELL IGNIS THE NEED FOR LCLS-II
Paper Airplanes & Scientific Methods
EASE: Alert Management
Designing and Prototyping Optics Table and Shelves
Managing Diffraction Beam
Integration of Blu-Ice into
3.1 Inquiry and the Scientific Method
Effects of Wrapping Vacuum
LCLS CXI Sample Chambers and Engineering Solutions
Values to Normalize Data
LAMP Vacuum System Documentation
Your Name, Elias Catalano, Robert Sublett, Robin Curtis
Determining a Detector’s Saturation Threshold
HXRSD Helium Enclosure
Crystallography Labeling Using Machine Learning
IT Inventory and Optimization
Research Methodologies, Realities and Funding Sources: A Perspective
Creation and development of a PRP database
IT Inventory and Optimization
Electro Optic Sampling of Ultrashort Mid-IR/THz Pulses
Margaret Koulikova1, William Colocho+
Scientific Method.
Do now: Write down 3 things that you think involves biology
Repeatability Test of Attocube Piezo Stages
Analytical Solutions for Temperature
Barron Montgomery1,2, Christopher O’Grady2, Chuck Yoon2, Mark Hunter2+
Good/Bad, Happy/Sad conducting sentiment analysis on user survey data from Houghton Library with R.
Elizabeth Van Campen, Barrack Obama2, Stephen Curry1,2, Steve Kerr2+
Advanced Methods of Klystron Phasing
Systematic Investigation: The Scientific Method
Ryan Dudschus, Alex Wallace, Zachary Lentz
The Scientific Method C1L1CP1 How do scientists work?
Jennifer Hoover1, William Colocho2
Steps of answering a scientific question
Redesigning of The SED Calendars
Automatic compression control for the LCLS injector laser
Russell Edmonds1, Matt Hayes2+ 1LCLS Summer Research Intern
Title of Your Poster Introduction Research Conclusions Acknowledgments
Reconstruction algorithms Challenges & Future work
Chapter Three: The Scientific Process
Scientific Method.
Presentation transcript:

Sentiment Analysis and Data visualization Techniques of Experiments Spencer Rodrigues1,, Nikola Stojanovic 2, Elizabeth Goodwin2, Joe Robinson2 1Statistics Department, California Polytechnic State University, 1 Grande Ave, San Luis Obispo, CA 93407, USA 2Linac Coherent Light Source, SLAC National Accelerator Laboratory, 2575 Sand Hill Road, Menlo Park, CA 94025, USA contacts: stojanik@slac.Stanford.edu egoodwin@slac.Stanford.edu jsrob@slac.Stanford.edu Background First Conclusion Techniques of Experiments With all the experiments that are run at SLAC, I thought it would be interesting to see which experimental techniques were used most often. Sentiment analysis is the process of computationally identifying and categorizing opinions expressed in a piece of text, especially in order to determine whether the writer's attitude towards a particular topic, product, etc., is positive, negative, or neutral. The downside with this analysis is that it gets approximately 80% of its classifications correct, which means we could miss out on some important reviews. That means that we still need someone to glance over the reviews before the meetings, but it will happen much more quickly than before thanks to this program Problem Statement Data investigation This project addresses the need for quick and accurate analysis of user’s reviews from their time in LCLS. This is necessary because we ask them many questions and when we receive their responses we want to be able to pick out which responses are negative and discuss how we can improve in those areas. By having a program run through the responses that we get we will also be saving a lot of time. The other part to my summer project was to investigate some data that I was given. All of the variables were categorical, which really limited the amount of analysis that I could do on them; however, I was able to look into the data and make some visualizations of relationships that I found interesting. The dataset that I had access to had over seventy thousand observations, and for each observation we had 30 variables that went with it. The variables are: Country, Citizenship, Department, Eligible For Beamtime, Ethnicity, Facility, Facts Sensitivity, Funding Source, Institution Code, Institutions Code, Instrument, Run, Person Id, Person Type, Proposal Id, Research Area, Sex, Small Business, State, Submission, Cycle, Submission Cycle, Technique, Time Submitted, Year Of Birth, and Zip code. These were all collected from every experiment from every run at LCLS. Figures 2: Pie chart of techniques and number of times each technique was used Research Areas There are also many different areas of research that are done at SLAC. Over all the runs at SLAC in LCLS the areas of research and their amounts are in the plot below. Program This was all achieved by using the statistical software called R. We had to download several different packages to be able to achieve our goal. The packages that were downloaded were: sentiment, tm, NLP, and Rstem. The function that we used is called Qdap and takes the string as an input. The output that we are given rates the text as positive, negative, or neutral. It also gives a polarity score, which determines the strength of the rating as mentioned above. If we cared about the emotions portrayed in the review we could also apply a different function that has the capability of returning the main emotion portrayed in the review, this function is called classify_emotion. Frequent User Countries I thought it would be interesting to investigate some of the variables that went along with the countries that have used the SLAC LCLS facilities more than one hundred times. The first important plot that I thought was necessary when looking into the frequent User countries was how many experiments have actually been run by each country while they were at SLAC. Figure 3: Pie chart of research areas and number of times each area was researched at LCLS Second Conclusion With 30 variables and seventy thousand observations there are countless possibilities for investigation, so I decided to post the three that I found most intriguing. With all the other variable associations that I investigated and to the many I didn’t, I’m; sure there are many new things that we can find out about SLAC to help decide where they can focus their advertisements to other countries and other institutions in the United States, to help further the scientific discoveries at our great facilities. Figure 1: Bar graph of number of experiments associated with the country of the User group Date: 09/07/2016