@berdandy
2/2
You should consider a 'document management system' (#dms) for much more than 2k titles. Just know that these are almost universally server type systems with web GUIs. You'll have to learn a bit of #selfhosting but that's no bad thing.

Some good options here:
https://awesome-selfhosted.net/tags/document-management.html

I use Mayan EDMS to manage a growing ~80k assorted document titles, including rpg.

Paperless-ngx should meet your, presumably, more modest needs and is easier to set up.

#documentmanagementsoftware

Document Management - awesome-selfhosted

A list of Free Software network services and web applications which can be hosted on your own servers

Tutorial: Paperless-NGX in a LXC-Container on Proxmox - Part 1

Important notes:

  • I am not a leading expert on any of the topics covered here. This is surely not the the "ultimate guide". If you stumble over things I could have done better, I gladly listen to your advice.
  • This tutorial will come in multiple parts. I will link them together in the end and put them in a blog post.

Background

Paperless NGX ist an Open Source Software that allows document management. It allows you to organize your documents in an efficient way. You can find the software and a lot of documentation here: https://docs.paperless-ngx.com/

LXC is a system-container technology. It is an attempt to provide something more lightweight than a virtual machines but that still feels like one . They are intended that you login to them and manage them mostly like a Linux virtual machine. You can find a more in depth explanation here: https://linuxcontainers.org/

Proxmox VE is a virtualization environment like VMware. It allows you to run full fledged virtual machines and LXC containers. It has become my virtualization platform of choice. There is a free and a comercial version available here: https://www.proxmox.com/en/products/proxmox-virtual-environment/overview

Why Paperless-NGX?

In 2009 we had to take over the paperwork for elderly relatives they could no longer handle themselves. To make things even more complicated, my wife and I were living in different towns during the week due to work reasons. Also we had recently bought a house which also significantly increased joint paperwork. In order to work jointly on documents, we needed to have them in a digital format.

So I bought a scanner. Better said: I bought multiple scanners until I settled for a Brother ADS-1600W.

I don't want to go into too much depth, but there are two points really important in a scanner:

  • The scanner can handle stacks of papers with reasonable reliability
  • The scanner can work without any PC and deliver documents directly into a directory
  • Luckily I am experienced with processes, so the best decision I made was to establish a process my wife and I both adhered to:

    • Any document that arrrived in paper first received a serial number. For this I bought a pagination stamp which creates an automatic serial number on each stamping process. This was one of my best purchasing decisions.
    • Then the document was scanned.
    • When the document was scanned it was stamped as "Scanned".
    • The paper document was archived in folders ordered by serial number. That allows fast retrieval of any original copy.
    • The scanned file was named according to a scheme that included Serial#, Recipient, Sender, Purpose, Date and "tags" like "relevant for taxes" or "healthcare". After naming it was sorted into one or more directories.

    Document retrieval was looking for file names. And this was were the process was now breaking after 16 years. We now had about 5.000 documents stashed there and it became more and more complicated to retrieve them. We needed full text search that works across multiple platforms (Windows, MacOS, IOS).

    The second reason to start with Paperless-NGX was that more and more document arrived in a digital fashion (download, email). The workflow was rudimentarily adjusted by having an "empty" serial# for such documents. I didn't want to print them, serialize them and the scan them again. That would have felt stupid. But this would not have been a good solution.

    The third reason was the amount of manual effort involved. Naming the files felt bothersome. I put it off for month and then hundreds of documents were waiting. Usually tax season was the ultimate deadline. Also in manual processes, you make a lot of mistakes, mostly spelling the name of the correspondent wrong or hitting a wrong number. There was no automation assisting me.

    I thought of using AI to partially automate this process (naming and retrieval). But then I stumbled over Paperless-NGX and had the impression that it would solve most of my problems "out of the box".

    Design choices

    The standard way to run Paperless-NGX is to use a docker container. I am using Proxmox VE and that solution did not support running docker containers directly. There were two options to address this:

  • Install a Linux VM, run Docker (or similar) inside that VM and then install Paperless-NGX as container
  • Install a LXC Container and get Paperless-NGX working inside that container
  • I decided for option 2 as it eliminated a technology layer completely. Docker is not extremely complex, but in my experience adding a technology layer to make your life simpler always backfires at some point. Furthermore using Docker makes you more prone to not understanding how the application really works. In case of debugging or updates this can carry a hefty price tag.

    This has one huge advantage to the reader of this tutorial. You can take 95% of this tutorial to install Paperless-NGX on a virtual machine or even a bare metal hardware.

    #paperless #paperlessngx #documentmanagementsoftware

    Home - Paperless-ngx

    Documentation for the Paperless-ngx document management system software.

    AI in Document Management: Automate & Boost Efficiency

    Artificial Intelligence in Document Management:

    Discus Greenbox - Advanced Document Management Solution

    Centralize, automate, and secure document management with Discus Greenbox DMS. Perfect for regulated industries seeking compliance.

    Discover how robust security features like encryption, access controls, and audit trails can safeguard your data, giving you peace of mind and confidence in your document management system.

    πŸ” Learn more: https://zurl.co/4U6iv

    #discusit #Greenbox #ITSolutions #OrganizeContent #DocumentManagementSystem #GoPaperless #CloudStorage #DataSecurity #DMSSoftware #documentmanagementsoftware #filemanagementsoftware #DocumentController

    Ensuring Data Security: The Vital Role of Security Controls in Document Management Software

    Electronic Document Management System Is A Necessity in 2020

    Electronic document management system is something that everyone is in need of when a global pandemic is in effect if they want to make to sustain.

    Unified Infotech