An LLM powered webscraped that has awareness about the DOM elements and styling in context could potentially scrape things in a heirarchical manner that other tools cant really easily replicate, especially within a headless environment
Example use case : Hackernews comments extraction AND Extracting / Filtering out Textual Data from Main Hero Content for Varying Sources