Paperwork is a personal document manager for scanned documents (and PDFs).
It's designed to be easy and fast to use. The idea behind Paperwork is "scan & forget": You should be able to just scan a new document and forget about it until the day you need it again.
In other words, let the machine do most of the work for you.
- Scan
- Automatic detection of page orientation
- OCR
- Document labels
- Automatic guessing of the labels to apply on new documents
- Search
- Keyword suggestions
- Quick edit of scans
- PDF support
Note regarding updates: If you're upgrading from a previous version installed with pip, it is strongly recommended you uninstall it first before installing the new version.
- GNU/Linux Archlinux
- GNU/Linux Debian
- GNU/Linux Fedora
- GNU/Linux Gentoo
- GNU/Linux Ubuntu
- Docker
- GNU/Linux Development
- Windows: Installation / Development
If you used the installer from OpenPaper, Paperwork can be uninstalled like any other Windows application (something like Control Panel --> Applications --> Uninstall).
If you installed it manually (for development), you can follow the same process than for GNU/Linux
- Extra documentation / FAQ / Tips / Wiki
- Mailing-list (be careful: By default, Google groups set your subscription to "no email")
- Bug trackers
Papers are organized into documents. Each document contains pages.
It mainly uses:
- Sane/Pyinsane: To scan the pages
- Tesseract/Pyocr: To extract the words from the pages (OCR)
- GTK: For the user interface
- Whoosh: To index and search documents, and provide keyword suggestions
- Simplebayes: To guess the labels
- Pillow/Libpillowfight: Image manipulation
Paperwork has automated tests. These tests are designed in a unusual manner: They use screenshots (aka "May work or may not work. Do you feel lucky today ?"). Because these screenshots change often and are heavy (compared to source files), they are stored in a separate Git repository: paperwork-tests
GPLv3 or later. See COPYING.
All the information can be found on the wiki