Type for search...
codete case study dmt main 5521292933
Codete Blog

Case Study: Content Monitoring Tool

codete logo 41a83d4d26

05/11/2021 |

2 min read

Codete

We partnered with a well-known tech university from the United States to build a fully automated, NLP-powered social media monitoring tool for sentiment analysis.

Codete's researchers are involved in both internal and external projects. We cooperate with leading technology universities in the world in the area of Research & Development (R&D). Our work is mostly focused on (but not limited to) data science, image and video processing, as well as NLP. 

In this project, we used NLP to create a tool for monitoring social media content and analyzing sentiment in the posts.

codete case study dmt 1 e8c5f205f8

 

Challenge: Real-time sentiment analysis with machine learning

The growth of social media usage changed the world of marketing. It has never been easier to deliver content than it is now. Any information can spread across the globe in less than a second, so it is critical to recognize what is happening with your product or brand as soon as possible. 

It is especially important if something goes wrong or if you want to see how other people perceive what you do. That is why we believe that having continuous and fully automated social media monitoring tool is a must-have for all businesses, and sentiment is one of the most important factors to monitor.

 

Solution: Custom NLP solution for Twitter

Our partner's CoreNLP is a cutting-edge NLP library that we intended to use for our content monitoring tool, but its accuracy was only about 50%, which fell short of our expectations. The tool was intended to work with longer, usually grammatically correct texts, but because Twitter limits message length and the platform's users don't care that much about linguistic correctness, the library's ability to work properly with the texts we had was unsatisfactory. 

We created our own solution by combining the TFIDF vectorization method, PCA, and Random Forest classifiers. We collected publicly available datasets of tweets labeled with their sentiment for the training phase. As a result, our method achieved over than 75% accuracy.

 

Tech stack

codete case study dmt tech stack 84b06c3c44
  • Python
  • Scikit-learn

 

Rated: 5.0 / 1 opinions
codete logo 41a83d4d26

Codete

IT consulting and software development company. Since 2010, we’ve been supporting businesses worldwide in gaining competitive advantage by means of modern technology. We advise on digitalization, develop and implement high-quality solutions, and augment our clients’ teams with skilled software developers.

Our mission is to accelerate your growth through technology

Contact us

Codete Przystalski Olechowski Śmiałek
Spółka Komandytowa

Na Zjeździe 11
30-527 Kraków

NIP (VAT-ID): PL6762460401
REGON: 122745429
KRS: 0000696869

Offices
  • Kraków

    Na Zjeździe 11
    30-527 Kraków

  • Lublin

    Wojciechowska 7E
    20-704 Lublin

  • Berlin

    Wattstraße 11
    13355 Berlin

Copyright 2022 Codete