Skip to content

oudeline/chain-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

chain-scraper

This is my first scraper: it is dynamic and developed in R. The goal of the project was to 1) Be able to automate the creation of datasets of adversarial attacks, 2) Be able to assign a risk score to the attack automatically. In this script, I focused on Chainalysis blog, and specifically the actions of the DPRK-linked Lazarus Group. I scraped the articles by their tag and created a table consisting of dates, links, titles, and summaries. Then I also created a scraping loop. After the data pulling, I selected certain keywords, like "hack" and "weapons." The idea is to not only search the articles for them, but later develop a risk-scoring tool. For example, "weapons" or "nuclear" will correspond to a higher risk score than "scam," so that each attack could be categorized by its severity automatically, before a human checks it. As coding is not my specialty, the code is imperfect, and the scope is quite limited, as of now. In the future, I believe, I can expand it and customize it as my tool.

About

Chainalysis blog scraper. Dynamic and can be used for any tag, I focused on their articles about the Lazarus Group.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages