• redw04
    link
    fedilink
    arrow-up
    3
    arrow-down
    8
    ·
    9 months ago

    So uh…as someone who’s currently trying to scrape the web for email addresses to add to my potential client list … where do I start researching this?

    • lutillian@sh.itjust.works
      link
      fedilink
      arrow-up
      4
      ·
      9 months ago

      Start looking into selenium, probably in Python. It’s one of the easier to understand forms of scraping. It’s mainly used to web testing, though you can definitely use it for less… nice purposes.

    • PieMePlenty@lemmy.world
      link
      fedilink
      arrow-up
      2
      ·
      edit-2
      9 months ago

      Step one will be learning to code in any language. Step two is using a library to help with it. HtmlAgilityPack has always been there for me. Don’t use regex.

      • Rodeo
        link
        fedilink
        arrow-up
        2
        ·
        9 months ago

        Virgin library user vs. Chad regex dev