This is my first scraper: it is dynamic and developed in R. The goal of the project was to 1) Be able to automate the creation of datasets of adversarial attacks, 2) Be able to assign a risk score to the attack automatically. In this script, I focused on Chainalysis blog, and specifically the actions of the DPRK-linked Lazarus Group. I scraped the articles by their tag and created a table consisting of dates, links, titles, and summaries. Then I also created a scraping loop. After the data pulling, I selected certain keywords, like "hack" and "weapons." The idea is to not only search the articles for them, but later develop a risk-scoring tool. For example, "weapons" or "nuclear" will correspond to a higher risk score than "scam," so that each attack could be categorized by its severity automatically, before a human checks it. As coding is not my specialty, the code is imperfect, and the scope is quite limited, as of now. In the future, I believe, I can expand it and customize it as my tool.
oudeline/chain-scraper
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|