Skip to content

creating scraper #1

@erikao

Description

@erikao

These are the steps I think make sense in working with this scraper. The scraper this code is based off of assumes it is cycling through paragraph tags within a div, not classes of separate divs.
-1st figure out how to get it to parse separate divs
-2nd figure out how to get it to search within a div for a string, ie the test page has a lengthy class with many tags in it, including "post." Need to be able to compare just "post," not the full class name.
-3rd would be helpful to dump the tags separately, they include useful topical info

Python assistance appreciated!

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions