Open in app

Sign In

Write

Sign In

Petr Korab
Petr Korab

711 Followers

Home

Lists

About

Published in

MLearning.ai

·Pinned

My articles, lectures and papers

Index of my articles and lectures that will help you find your way On my blog, you find: my articles on medium and in other professional periodicals my recent research papers my lecture slides and teaching materials — my current affiliation is Zeppelin University in Friedrichshafen. My fields of interest…

Text Mining

2 min read

Index of my articles
Index of my articles
Text Mining

2 min read


Published in

Towards AI

·Jul 30

Research Article Meta-data Description Made Quick and Easy

Summarize essential information from research meta-data with text mining methods in a few lines of Python code — By: Petr Koráb (Zeppelin University in Friedrichshafen, Germany; Lentiamo, Prague) and Jarko Fidrmuc (Zeppelin University in Friedrichshafen, Germany) Introduction Research publication meta-data (article titles, publication dates, keywords) provide valuable insights into how the whole field develops over time. Text mining with appropriate techniques helps discover which concepts, theories, or models were…

Text Mining

7 min read

Research Article Meta-data Description Made Quick and Easy
Research Article Meta-data Description Made Quick and Easy
Text Mining

7 min read


Published in

Python in Plain English

·Jun 8

FinVADER: Sentiment Analysis for Financial Applications

FinVADER is a Python library extending the popular VADER sentiment classifier with two financial lexicons. Learn more in this tutorial. — Introduction Sentiment analysis uses various methods to evaluate sentiment from text data. Deep learning (transformer-based specifically) models like Google’s BERT or Meta’s RoBERTa reach classification accuracy far beyond 94 %. The performance of sentiment classifiers strongly depends on the nature of the data. Some fields, like finance and economics, use specific…

Sentiment Analysis

7 min read

FinVADER: Sentiment Analysis for Financial Applications
FinVADER: Sentiment Analysis for Financial Applications
Sentiment Analysis

7 min read


Published in

MLearning.ai

·May 13

Fine-tuning VADER Classifier with Domain-specific Lexicons

Widely used VADER sentiment classifier uses a general-language lexicon to classify language expressed on social media. This article shows how extending the core lexicon with two domain-specific lexicons improves classification accuracy from 58 % to 69 % on Financial PhraseBank data. Introduction VADER (Valence Aware Dictionary and sEntiment Reasoner) classifier is…

Sentiment Analysis

6 min read

Fine-tuning VADER Classifier with Domain-specific Lexicons
Fine-tuning VADER Classifier with Domain-specific Lexicons
Sentiment Analysis

6 min read


Published in

Towards Data Science

·Apr 10

Customer Satisfaction Measurement with N-gram and Sentiment Analysis

Product reviews are an excellent source of information for qualified management decisions. Learn more about the right text mining techniques. — Introduction Happy customers drive company growth. The five-word sentence explains everything about why we do our best to maximize customer satisfaction. Product reviews are one of the major data sources that large companies like Amazon and Apple, middle-sized exporters including Lentiamo, and local companies running their Facebook pages collect. Reviews are…

Text Mining

7 min read

Customer Satisfaction Measurement with N-gram and Sentiment Analysis
Customer Satisfaction Measurement with N-gram and Sentiment Analysis
Text Mining

7 min read


Published in

Towards Data Science

·Mar 6

Sentiment Analysis and Structural Breaks in Time-Series Text Data

Arabica now offers a structural break and sentiment analysis module to enrich time-series text mining — Introduction Text data contains lots of qualitative information, which can be quantified with various methods, including sentiment analysis techniques. These models are used to identify, extract and quantify emotions from text data and have wide use in business and academic research. Since the text is often recorded on a time-series basis…

Text Mining

7 min read

Sentiment Analysis and Structural Breaks in Time-Series Text Data
Sentiment Analysis and Structural Breaks in Time-Series Text Data
Text Mining

7 min read


Published in

Towards Data Science

·Feb 9

Text Data Pre-processing for Time-Series Models

Have you ever thought about how sentiment from text data can be used as a regressor in time-series models? — Introduction Text data offer qualitative information that can be quantified, aggregated, and used as a variable in time-series models. Simple methods of text data representation, such as one-hot encoding of categorical variables and word n-grams, have been used since NLP’s early beginnings. Over time, more complex methods, including the Bag-of-words model…

Text Mining

6 min read

Text Data Pre-processing for Time-Series Models
Text Data Pre-processing for Time-Series Models
Text Mining

6 min read


Published in

Towards Data Science

·Jan 9

Visualization Module in Arabica Speeds Up Text Data Exploration

Arabica now offers unigram, bigram, and trigram word cloud, heatmap, and line chart to further accelerate time-series text data analysis — Introduction Arabica is a python library for exploratory text data analysis focusing on text from a time-series perspective. It reflects the empirical reality that many text datasets are collected as repeated observations over time. Time series text data include newspaper article headlines, research article abstracts and metadata, product reviews, social network…

Text Mining

6 min read

Visualization Module in Arabica Speeds Up Text Data Exploration
Visualization Module in Arabica Speeds Up Text Data Exploration
Text Mining

6 min read


Published in

Python in Plain English

·Nov 26, 2022

Arabica is Now Fully Documented

Arabica now has easy-to-follow documentation, including coding examples for descriptive and time-series text data analysis — Arabica is a Python library for exploratory data analysis specifically designed for time-series text data. It reflects the reality that many text datasets are currently collected as repeated observations over time (product reviews, communication on social media, article headlines, etc.). It makes exploratory analysis of these datasets simple by providing:

Text Mining

2 min read

Arabica is Now Fully Documented
Arabica is Now Fully Documented
Text Mining

2 min read


Published in

Towards Data Science

·Nov 15, 2022

Contour Plots and Word Embedding Visualisation in Python

Contour plots are simple and very useful graphics for word embedding visualization. This end-to-end tutorial uses IMDb data to illustrate coding in Python. — Introduction Text data vectorization is a necessary step in modern Natural Language Processing (NLP). The underlying concept of word embeddings has been popularised by two Word2Vec models developed by Mikolov et al. (2013a) and Mikolov et al. (2013b). …

Text Mining

5 min read

Contour Plots and Word Embedding Visualisation in Python
Contour Plots and Word Embedding Visualisation in Python
Text Mining

5 min read

Petr Korab

Petr Korab

711 Followers

Data scientist and researcher - Text Mining, ML, Data Visualization

Following
  • TDS Editors

    TDS Editors

  • Cassie Kozyrkov

    Cassie Kozyrkov

  • Kamal Dhungana

    Kamal Dhungana

  • Dipika Baad

    Dipika Baad

  • Dr. Veronica Espinoza

    Dr. Veronica Espinoza

See all (40)

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech

Teams