I have an unRAID server with a bunch of dockers on, and yet I’m still scanning and filing my documents in an SMB share like a goon!

What options are out there for me? I’m after something that has the following features:

- Scan to email functionality for ingest as well as manual ingest from another digital file share

- OCR

- Tagging

I’m honestly not sure what else

Suggestions?

  • mpopgun@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    Look at nextcloud. It can be mapped to use existing file structure. Has plenty of plugins to do file management, file routing, ocr.

    Sterling PDF might be a useful tool as well.

  • sumistev@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    I’m drinking the paperless-ngx koolaid very hard. Have digitized over 1k documents into it so far. Fast and easy to use.

    • stphn17@alien.topB
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      The only issue I see with paperless-ngx is that you cannot use an existing folder structure, or has that changed in the meantime?

      I would like to access the documents via paperless-ngx but would also like to preserve and continue to use my existing folder structure, especially to make retrieval of documents easier for someone else than me in case of emergency or if I cannot use paperless-ngx for whatever reason.

      I have made the experience that following along a clearly defined logical folder structure is easier for someone who hasn’t spend ours creating the structure themselves or doesn’t know about paperless-ngx.

      • sumistev@alien.topB
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        There are “storage partitions” (if I’m remembering the wording correctly) that let you put documents into physical storage locations, but there’s not a formal folder structure. For me I constantly found myself needing documents in two places (eg: property tax bill in both the folder for my house as well as my annual income tax filing, since I want all the documents for that together too). Formal folder structure was too limiting for me. Having things tagged just works better for me and eliminated my problem of having to commit to a folder structure that I wouldn’t like next year.

        • stphn17@alien.topB
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          Absolutely agree and that’s where paperless-ngx will shine. But for my documents I prefer a tool agnostic (and therefore future proof) way of storing. In case of multiple places, where a document could go, I always think “what’s the most likely way I will be looking for this document in the future?”.

      • marmata75@alien.topB
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        While it doesn’t use a fixed folder structure, you can decide the folder structure for each file based on any attribute. So all 2023 receipts for car a can go to “car a/receipts/2023” or “receipts/2023/car a” of whatever you wish. Really flexible! And if you change idea, you change the scheme and all the files are moved where they belong!

    • t3abagger@alien.topB
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      With the last upgrade I can’t scan in any docs and I get errors:

      documents.parsers.ParseError: SubprocessOutputError: Ghostscript rasterizing failed. See logs for more information.

      You aren’t having that? I did some searching and none of the workarounds aren’t working… around the issue.

      • fedroxx@alien.topB
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        I was having that error and it was caused by a compose configuration. I had /tmp incorrectly mapped. Removed the map entirely. Started working again.

      • sumistev@alien.topB
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        I am on version 1.17.4, not having any issues scanning in documents still. Loaded a few more in today.

  • MoistTowelettes1@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    Paperless-NGX is the way to go.

    Bonus points if you’re on iPhone because QuickScan recently added Paperless-NGX support so you can quickly scan and upload documents without a hassle.

  • KeyWinter4959@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    You can use nextcloud with applications. One day I was playing with PDF files without a text layer. I have everything set up to automatically recognize and apply a text layer to PDF files.
    Nextcloud has many features, you can look in more detail. File synchronization, working with mail, instant messengers and much more.

    This section will be useful:
    Flow
    Flow configuration
    Files access control
    Denied access
    Examples
    Denying access to folders
    Prevent uploading of specific files
    Common misconfigurations
    Available rules
    Automated tagging of files
    Assigning restricted and invisible tags
    Example
    Available rules
    Executing actions
    Retention of files
    Example
    Common misconfigurations