Skip to content

Add section using scrapyloganalyzer #80

@jpmckinney

Description

@jpmckinney

Similar to the logic in the data registry.

https://github.com/open-contracting/data-registry/blob/23767b735e68e52a7705b1c0ea390aa60b15149c/data_registry/process_manager/task/collect.py#L124-L145

We can also add https://scrapy-log-analyzer.readthedocs.io/en/latest/api/index.html#scrapyloganalyzer.ScrapyLogFile.is_complete to indicate whether the crawl was a subset (sample, etc.).

cc @yolile

Here is the commit from an older PR: b970412


scrapy-log-analyzer's logparser dependency is GPL. Might need to modify license for relevant notebooks.

https://ocp-software-handbook.readthedocs.io/en/latest/python/preferences.html#license-compliance

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions