-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OCR to clipboard hook for selections #702
Comments
OCR would be great honestly! |
There is a powerful OCR engine from Google called https://github.com/tesseract-ocr/tesseract and it's open-sourced. It would be super cool if the maintainers could integrate tesseract to flameshot as the author suggested, by doing so there would no need for external API connections such as Amazon's, because tesseract works offline. |
There's a project called dpscreenocr that does the job, yet the selection is not as convenient and user-friendly as flameshot's. |
I implemented that feature and created a pull request: #1239 |
@enoy19 after fixing you typo/c&p error in 8432d12, this compiles in Manjaro and the OCR works really great, thanks! edit: you have to change the header to |
https://news.ycombinator.com/item?id=27242392 discusses the topic in a MacOS context |
Anything new around this feature request? It would be super cool to me tho. Oftentimes i'm working with text copying and this |
@Pheggas If you are comfortable with compiling yourself, check my comment above. I'm using this feature since then and it works great. |
I am okay to merge this feature if we can:
|
I'd like to. But it would be my first compilation. I'll try to look at it. In case of failure, I'll write you 🙂 |
I am anxious waiting for this feature! |
For anybody waiting for the functionality I found this blog post: If you install tesseract, it is basically a one liner flameshot gui --raw | tesseract stdin stdout | xclip -in -selection clipboard |
It's an amazing solution! Thanks! |
For those using wayland use this command instead:
it requires the installation of wl-clipboard, together with tesseract and the respective tesseract language data package. For me in arch linux the packages are : in arch linux it can be installed with: |
Last year, I've spent a view hours to adapt @enoy19's pull request #1239 to the newer codebase but rm -rf'ed the directory by accident. Yesterday I've done it again. You can find my fork here: xeaon/flameshot. After taking a screenshot, you can choose the OCR tool to copy the tesseract output to your clipboard. If successful there is a notification. As of now, you need to install the dependency tesseract-ocr and leptonica manually. It won't build in CI but I'm trying to fix it for Linux builds. As mentioned before, I'm no C++/Qt dev, so this is pretty challenging. Feel free to contribute to my fork, if you have any skills. This feature can be merged, if we can
as @borgmanJeremy mentioned earlier. |
This comment was marked as off-topic.
This comment was marked as off-topic.
@pitfiend Thanks for your comment, but I am going to hide your comment, and here is few points to clarify my reasoning:
|
This is my version in Ubuntu using custom shortcuts. I've created a shell script named flameshot gui --raw | tesseract stdin stdout | tr -d '\n' | xclip -in -selection clipboard The addition of To utilize this script in Ubuntu, you can set up a custom keyboard shortcut as follows:
This will enable you to use the script conveniently with a keyboard shortcut of your choice. Linux makes your digital life easier : ) |
Hi guys, this is my custom version to linux mint. I have been exploring methods to enhance the OCR output, and my first step was to define the language settings, which can also involve combinations. Through varying the flameshot gui --raw | tesseract stdin stdout -l eng+spa --psm 6 | xclip -in -selection clipboard It's noteworthy that the optimal performance is achieved using Tesseract version 5. There are various ways to download or even compile it. I'm attaching the simplest method, which is through the PPA: sudo add-apt-repository -y ppa:alex-p/tesseract-ocr-devel This PPA was sourced from the following link, which originates from the official Tesseract page: |
+1 tesseract ocr should be a built-in functionality |
Here's a really good PR by @rsrdesarrollo 🎉 It checks for Tesseract in PATH meaning the feature is opt-in and uses upstream updates rather than a statically linked library (which would inevitably become outdated) Here's how I'm using it in Sway (Wayland) with @DO-Ui's Tesseract workaround for small letters: bindsym --no-repeat $MOD+PRINT exec flameshot gui --raw | convert -resize 400% png:- png:- | tesseract -l eng stdin stdout | wl-copy |
How do i run flameshot with tesseract on windows? |
We need built-in OCR. Coming from someone familiar with ShareX and CleanShotX, I can see FlameShot has the potential to get there. Features like OCR, GIF recording, and video capture would make FlameShot a better tool. As someone new to Linux, I haven't found any screen capture tool that even comes close to ShareX and CleanShotX. |
As it have been discussed, the dev team unanimously agreed that none of these is in the scope of a screenshot tool. We will only and only allow these via the plugin system.
I have provided some explanation here: |
This sounds good. Similar to what Flow Launcher has come up with. Instead of implementing features natively, they have implemented a plugin system. That way, the app itself isn't bloated with unnecessary features. You choose what you want to install. (It's a universal search engine) |
Sometimes I'm working with screenshots and would be awesome to make a selection around some text and send it off to say https://aws.amazon.com/rekognition/ for it analyse and return the text to set my clipboard.
The text was updated successfully, but these errors were encountered: