Skip to content

Dilbert, Grimmy, Kevin and Kell not downloading #109

@eikelan

Description

@eikelan

1st of all: Thank you for dosage!
I'm using dosage-2.15-1 from Debian testing (Stretch)
Most of the comics I'm interested in are downloading fine
with 3 exceptions:
Dilbert, Mother Goose and Grimm, Kevin and Kell
I'm adding the error messages below.
No, it is not a matter of life and death but would be nice having ;-)

Thank you for your time.
Eike, Paraguay

Dilbert
where I used to look:
http://dilbert.com/strip/2015-07-26

where dosage looks:
http://www.arcamax.com/thefunnies/dilbert/s-1693329
the cartoon is there alright

~$ dosage Dilbert:2015-07-21
Dilbert> ERROR: Patterns ['<\s_[aA]\s+(?:[^>]\s+)?[hH][rR][eE][fF]\s=\s_"(/\d+-\d+-\d+/)"[^>]STR_Prev[^>]>'] not found at URL http://dilbert.com/.


Mother Goose and Grimm

where I used to look:
http://www.grimmy.com/images/MGG_Archive/MGG_2015/MGG-2015-07-26.gif
where dosage looks
http://www.arcamax.com/thefunnies/mothergooseandgrimm/s-1693318
the cartoon is there alright

~$ dosage MotherGooseAndGrimm:2015-07-21
Arcamax/MotherGooseAndGrimm> Retrieving 1 strip for index 2015-07-21
Arcamax/MotherGooseAndGrimm> ERROR: Patterns ['<\s_[aA]\s+(?:[^>]\s+)?[hH][rR][eE][fF]\s=\s_"(/newspics/[^"]+)"[^>]zoom[^>]>'] not found at URL http://www.arcamax.com/thefunnies/mothergooseandgrimm/2015-07-21.


KevinAndKell

//http://www.kevinandkell.com/2015/kk0725.html
is the comic url, but
dosage is looking for:
//http://www.kevinandkell.com/2015/kk725.html and fails

I don't know how to get the leading 0 if the month is 1..9
I tried the following to test, and it works but will stop working 1st of october obviously
I'm not proficient enough to program in Python2.7

class KevinAndKell(_BasicScraper):
description = u'Kevin & Kell by Bill Holbrook'
url = 'http://www.kevinandkell.com/'
stripUrl = url + '%s/kk0%s%s.html'
firstStripUrl = stripUrl % ('1995', '09', '03')
imageSearch = compile(r'<img.+?src="(/?(\d+/)?strips/kk\d+.(gif|jpg))"', IGNORECASE)
prevSearch = compile(r'<a.+?href="(/?(../)?\d+/kk\d+.html)"[^>]*>Previous Strip', IGNORECASE)
help = 'Index format: yyyy-mm-dd'

def getIndexStripUrl(self, index):
    return self.stripUrl % tuple(map(int, index.split('-')))

stripUrl = url + '%s/kk0%s%s.html'
does not work for month 10..12

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions