Thomas Aglassinger - Introduction to sentiment analysis with spaCy 1

Conference: EuroPython 2018

Year: 2018

Introduction to sentiment analysis with spaCy [EuroPython 2018 - Talk - 2018-07-26 - PyCharm [PyData]] [Edinburgh, UK] By Thomas Aglassinger Sentiment analysis aims at extracting opinions from texts written in natural language, typically reviews or comments on social sites and forums. SpaCy already provides mechanisms for dealing with natural languages in general but does not offer means for sentiment analysis. This talk gives a short introduction to sentiment analysis in general and shows how to extract topics and ratings by utilizing spaCy's basic tools and extending them with a lexicon based approach and simple Python code to consolidate sentiments spread over multiple words. Topic covered are: What is sentiment analysis? Levels of sentiment detection Representing opinions Splitting texts in sentences and words. Finding the base word (lemma) Extending spaCy's pipeline and tokens Matching words to topics and ratings Combining multiple words to a rating Code examples are introduced and explained using a Jupyter notebook that can be used as basis for your own analysis. As additional twist the analyzed texts are not in English but German to show that this approach can be used for multiple languages. No knowledge of German is required though because translations of the short examples sentences are provided. Author's note: This is an extended version of a talk I gave at the PyDays Vienna 2018. The original slides and Jupyter notebook are available at https://github.com/roskakori/talks/tree/master/pydays/analyzingnaturallanguagefeedbackusing_python. License: This video is licensed under the CC BY-NC-SA 3.0 license: https://creativecommons.org/licenses/by-nc-sa/3.0/ Please see our speaker release agreement for details: https://ep2018.europython.eu/en/speaker-release-agreement/