Skip to content

heath3rq/NLP_Company-Industry-Classification

 
 

Repository files navigation

IDS 703 Final Project: Assessing the Effectiveness of Linkedin Company Description in Classifying Industry

By Ya-Yun Huang, Emma Wang, Heather Qiu

Project Description

The primary objective of the project is to evaluate whether company descriptions on LinkedIn are sufficient to identify the type of business. As a prototype, we gathered organizational information from LinkedIn, including company names, profiles, and industries. The current study focuses on three predominant industries: Financial Services, Hospitals & Health Care, and IT Services & IT Consulting. However, our analysis can be scaled to incorporate additional business sectors for broader use cases. For instance, the language models employed in the research provide a good foundation for data applications that seek to generate sample company overviews and/or evaluate the quality of the generated company descriptions. The graph below visualizes popular keywords of LinkedIn organization profiles. The larger the size of the word, the more frequently it appears in the dataset.

output

Project Files

Installation Requirements

Please execute the following commands to install packages necessary to run LinkedInCompanyClassification.ipynb: pip install --upgrade pip && pip install -r requirements.txt

About

The primary objective of the project is to evaluate whether company descriptions on LinkedIn are sufficient to identify the type of business.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 100.0%