Тёмный

Text analysis in R. Demo 2: Sentiment dictionaries 

Kasper Welbers
Подписаться 2,5 тыс.
Просмотров 4,7 тыс.
50% 1

This demo is part of a short series of videos on text analysis in R, developed mainly for R introduction workshops.
A more detailed tutorial for the code discussed here can be found on our R course material Github page:
github.com/ccs-amsterdam/r-co...
Vignette for how to use corpustools:
cran.r-project.org/web/packag...

Опубликовано:

 

5 окт 2020

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 4   
@callingwind5071
@callingwind5071 2 года назад
Hi Kasper, thanks for the video. Is it possible to have the sentiment analysis for French?
@kasperwelbers
@kasperwelbers 2 года назад
Hi @Callingwind, you're welcome. You could use a french dictionary. I know there are translated versions of the NRC dictionary (though I don't know whether they are any good), and saw that there is a french adoption of the NRC called "FEEL" (french expanded emotion lexicon). The main thing to watch out for is that you deal with accents and stuff properly. More generally, if you want to do NLP with french it's good to read up on character encoding (if you haven't already). Simply put, you'll either want both your texts and dictionary to be in unicode (UTF-8) and include special characters, or transliterate it to ASCII. e.g. iconv("Hôtel-Résidence", to='ASCII//TRANSLIT') As a sidenote, I just want to point out that if you're looking into sentiment analysis for professional or academic application, you might want to consider a machine learning approach instead (transformers like BERT are killing it). The field of NLP has been moving fast, and dictionary based sentiment analysis is on the way out.
@callingwind5071
@callingwind5071 2 года назад
@@kasperwelbers super interesting answer. Thanks a lot. I am looking to build a recession index based on newspapers in my country. Any other tips?
@kasperwelbers
@kasperwelbers 2 года назад
@@callingwind5071 If by recession index you mean measuring whether/how newspapers mention recession, or more generally bad economic tidings, then I would recommend not using a general sentiment dictionary/model. A general sentiment analysis only tells you whether a text is positive/negative, but not really about what. While you could argue that sentiment in the context of business news is a decent proxy for recession, how good a proxy this is depends on how you operationalize and define recession. If you are very specifically interested in recession, you might even just use a small dictionary or search query to find explicit references to recession. You might then want to consider using boolean queries instead of simple dictionary terms. Then you can also specify that words need to occur within a certain distance (e.g., crash and market). This is possible with corpustools: cran.r-project.org/web/packages/corpustools/vignettes/corpustools.html#search_features
Далее
LDA Topic modeling in R
23:04
Просмотров 20 тыс.
Text analysis in R. Part 2: Analysis approaches
27:59
Все сезоны бравл пассов…!!!😨
00:59
Text analysis in R. Demo 1: Corpus statistics
23:12
Просмотров 19 тыс.
Sentiment Analysis in R
24:07
Просмотров 7 тыс.
Dictionary-Based Text Analysis
19:26
Просмотров 8 тыс.
Learn R in 39 minutes
38:56
Просмотров 629 тыс.
Topic modeling with R and tidy data principles
26:21
Просмотров 61 тыс.
Sentiment analysis with tidytext (R case study, 2021)
42:33
Text analysis / mining in R - how to plot word-graphs
25:38