Skip to content

Troubles parsing messenger chats #65

@Goretx

Description

@Goretx

Hello everyone!

I start saying I'm very inexperienced with... well... everything.

While parsing from WhatsApp works smoothly, I'm having some troubles with messenger chats.

This is the output I'm having:

2020-04-25 23:31:17,481 [INFO ] [parsers.mess]: Parsing Facebook messenger data...
Traceback (most recent call last):
  File "parse.py", line 84, in <module>
    ArgParse()
  File "parse.py", line 42, in __init__
    getattr(self, args.command)()
  File "parse.py", line 69, in messenger
    main(args.own_name, args.file_path, args.max)
  File "/Users/*****/Chatistics/parsers/messenger.py", line 22, in main
    data = parse_messages(file_path, own_name)
  File "/Users/*****/Chatistics/parsers/messenger.py", line 64, in parse_messages
    content = fix_text_encoding(content)
  File "/Users/*****/Chatistics/parsers/messenger.py", line 79, in fix_text_encoding
    return text.encode('latin1').decode('utf8')
UnicodeEncodeError: 'latin-1' codec can't encode character '\u201c' in position 17: ordinal not in range(256)

The output from export then prints:

...
2020-04-26 00:43:32,913 [INFO ] [utils       ]: Could not find any data for platform messenger

In /raw_data/messenger I have all the conversations folders that were previously contained in the inbox folder in the original archive downloaded from Facebook.
Like this: /raw_data/messenger/username/message_1.json.
(I think the documentation should be updated, it makes it look like that the inbox folder needs to be there too).

I'm working in Jupiter lab.

Thanks in advance!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions