Skip to content
This repository has been archived by the owner on Feb 19, 2021. It is now read-only.

Automatic detection of document type #292

Closed
BastianPoe opened this issue Jan 30, 2018 · 2 comments
Closed

Automatic detection of document type #292

BastianPoe opened this issue Jan 30, 2018 · 2 comments
Labels

Comments

@BastianPoe
Copy link
Contributor

Hi!

I have many documents that I want to store digitally and have found that searching for documents in the pile is a lot easier when knowing the type (e.g. invoice, credit card statement, etc.) of the stored documents. Since a couple of days I am thinking about an automated way to determine the type of a document (or even better the subject of it).

Currently, I do see the following options:

  • use tags
  • add a separate field for this (similar to correspondent)
  • find a magic library / webservice

What do you think? What are your needs?

Bastian

@danielquinn
Copy link
Collaborator

Depending on what you want to use to determine the type of a document, the tags system may be all you need. Typically I use a regex or even a literal match on my tags and that's served me well so far. Documents that contain the term NHS are automatically tagged health, while anything from HMRC it tagged taxes etc.

Do you have a different need/idea?

@danielquinn
Copy link
Collaborator

Closing this one to try to clean up the queue a bit. If you have some ideas you'd like to try out, feel free to re-open.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

2 participants