• wise_pancake
    link
    fedilink
    arrow-up
    6
    ·
    edit-2
    2 days ago

    Most recently:

    I had to parse an html table with a column of printed python dictionaries, the dicts wrapped a mixed delimiter list of SQL statements of various types.

    It’s truly an abomination.

    I’ve never had to use pythons AST module to parse data before…

      • wise_pancake
        link
        fedilink
        arrow-up
        2
        ·
        edit-2
        2 days ago

        It was 900mb raw and compressed to 1.6mb after cleaning.

        That’s how much junk data there was.

        Combing through that log is the only way to prove and identify where a big was introduced so I can fix it.