OCR to clipboard hook for selections #702

kaihendry · 2020-03-19T07:13:55Z

Sometimes I'm working with screenshots and would be awesome to make a selection around some text and send it off to say https://aws.amazon.com/rekognition/ for it analyse and return the text to set my clipboard.

MyriaCore · 2020-09-04T15:03:19Z

OCR would be great honestly!

nsa · 2020-09-12T21:34:42Z

There is a powerful OCR engine from Google called https://github.com/tesseract-ocr/tesseract and it's open-sourced. It would be super cool if the maintainers could integrate tesseract to flameshot as the author suggested, by doing so there would no need for external API connections such as Amazon's, because tesseract works offline.

JScriber · 2020-10-08T09:00:47Z

There's a project called dpscreenocr that does the job, yet the selection is not as convenient and user-friendly as flameshot's.
OCR integrated to flameshot with a dpscreenocr inspired configuration would be really great.

enoy19 · 2021-01-15T08:30:58Z

I implemented that feature and created a pull request: #1239

xeaon · 2021-03-31T13:14:57Z

@enoy19 after fixing you typo/c&p error in 8432d12, this compiles in Manjaro and the OCR works really great, thanks!

edit: you have to change the header to baseapi.h in src/tools/ocr/ocrtool.cpp#L22 as suggested in #1239 (comment)

kaihendry · 2021-05-22T12:34:35Z

https://news.ycombinator.com/item?id=27242392 discusses the topic in a MacOS context

Pheggas · 2021-11-10T15:24:36Z

Anything new around this feature request? It would be super cool to me tho. Oftentimes i'm working with text copying and this
feature would be absolute killer!

xeaon · 2021-11-10T16:36:53Z

@Pheggas If you are comfortable with compiling yourself, check my comment above.

I'm using this feature since then and it works great.

borgmanJeremy · 2021-11-10T16:45:09Z

I am okay to merge this feature if we can:

Fix the merge request to it passes CI
Explain how to package / distribute the language packs on all platforms.

Pheggas · 2021-11-10T19:29:26Z

@Pheggas If you are comfortable with compiling yourself, check my comment above.

I'm using this feature since then and it works great.

I'd like to. But it would be my first compilation. I'll try to look at it. In case of failure, I'll write you 🙂

leoneivaw · 2021-11-21T18:51:02Z

I am anxious waiting for this feature!
Thanks!

irfan798 · 2022-03-09T14:42:27Z

For anybody waiting for the functionality I found this blog post:
https://slint.github.io/blog/ocr-screenshot.html

If you install tesseract, it is basically a one liner

flameshot gui --raw | tesseract stdin stdout | xclip -in -selection clipboard

ek1ng · 2022-05-22T06:53:47Z

For anybody waiting for the functionality I found this blog post: https://slint.github.io/blog/ocr-screenshot.html

If you install tesseract, it is basically a one liner
flameshot gui --raw | tesseract stdin stdout | xclip -in -selection clipboard

It's an amazing solution! Thanks!

siva-sub · 2022-07-19T11:59:39Z

For anybody waiting for the functionality I found this blog post: https://slint.github.io/blog/ocr-screenshot.html

If you install tesseract, it is basically a one liner
flameshot gui --raw | tesseract stdin stdout | xclip -in -selection clipboard

For those using wayland use this command instead:

flameshot gui --raw | tesseract stdin stdout | wl-copy

it requires the installation of wl-clipboard, together with tesseract and the respective tesseract language data package.

For me in arch linux the packages are :
tesseract
tesseract-data-eng
wl-clipboard

in arch linux it can be installed with:
sudo pacman -S tesseract tesseract-data-eng wl-clipboard

xeaon · 2022-07-20T08:06:04Z

Last year, I've spent a view hours to adapt @enoy19's pull request #1239 to the newer codebase but rm -rf'ed the directory by accident.

Yesterday I've done it again. You can find my fork here: xeaon/flameshot. After taking a screenshot, you can choose the OCR tool to copy the tesseract output to your clipboard. If successful there is a notification.

As of now, you need to install the dependency tesseract-ocr and leptonica manually. It won't build in CI but I'm trying to fix it for Linux builds.

As mentioned before, I'm no C++/Qt dev, so this is pretty challenging. Feel free to contribute to my fork, if you have any skills. This feature can be merged, if we can

pass CI
Explain how to package / distribute the language packs on all platforms

as @borgmanJeremy mentioned earlier.

mmahmoudian · 2022-11-04T14:42:00Z

@pitfiend Thanks for your comment, but I am going to hide your comment, and here is few points to clarify my reasoning:

The software you suggested it platform specific (Windows)
If we implement a feature, our emphasis would be to implement it in a cross-platform way. There are some features that are Linux and macOS specific (like support of CLI), but Windows will never be our main focus, unless the maintainers of the project collectively agree otherwise. For now, the best-case scenario for the C# software you suggested is via a Windows-specific plugin.
The OCR can easily be implemented with 3 lines of shell script (detailed explanation here: https://mehrad.ai/posts/20210702-extracting-payment-info-in-rasterized-invoice/#ocr ).

Nasreddine · 2023-12-15T08:52:06Z

This is my version in Ubuntu using custom shortcuts.

I've created a shell script named shot.sh and placed it in /home/me/apps/. [change me to your username]
The script contains the following line:

flameshot gui --raw | tesseract stdin stdout | tr -d '\n' | xclip -in -selection clipboard

The addition of tr -d '\n' is specifically to remove line breaks from the output.

To utilize this script in Ubuntu, you can set up a custom keyboard shortcut as follows:

Open Settings.
Navigate to Keyboard > Keyboard Shortcuts.
Click on View and Customize Shortcuts.
Under Custom Shortcuts, click the + sign to add a new shortcut.
In the new shortcut setup, enter the following details:
- Name: flameshot_tesseract
- Command: sh /home/me/apps/shot.sh
- Shortcut: [Choose your preferred key combination]

This will enable you to use the script conveniently with a keyboard shortcut of your choice.

Linux makes your digital life easier : )

Sheldonimo · 2024-01-06T22:23:20Z

Hi guys, this is my custom version to linux mint.

I have been exploring methods to enhance the OCR output, and my first step was to define the language settings, which can also involve combinations. Through varying the --psm parameters in Tesseract, I discovered that the value of 6, corresponding to "Assume a single uniform block of text," yields the most favorable results. Below is the complete command for your experimentation:

flameshot gui --raw | tesseract stdin stdout -l eng+spa --psm 6 | xclip -in -selection clipboard

It's noteworthy that the optimal performance is achieved using Tesseract version 5. There are various ways to download or even compile it. I'm attaching the simplest method, which is through the PPA:

sudo add-apt-repository -y ppa:alex-p/tesseract-ocr-devel

This PPA was sourced from the following link, which originates from the official Tesseract page:

erfanium · 2024-01-28T10:53:16Z

+1 tesseract ocr should be a built-in functionality

GrabbenD · 2024-04-21T13:35:57Z

+1 tesseract ocr should be a built-in functionality

Here's a really good PR by @rsrdesarrollo 🎉
#3074

It checks for Tesseract in PATH meaning the feature is opt-in and uses upstream updates rather than a statically linked library (which would inevitably become outdated)

Here's how I'm using it in Sway (Wayland) with @DO-Ui's Tesseract workaround for small letters:

bindsym --no-repeat $MOD+PRINT exec flameshot gui --raw | convert -resize 400% png:- png:- | tesseract -l eng stdin stdout | wl-copy

X88R88 · 2024-05-08T06:03:57Z

How do i run flameshot with tesseract on windows?

f-x1-1 · 2024-05-09T05:01:15Z

We need built-in OCR. Coming from someone familiar with ShareX and CleanShotX, I can see FlameShot has the potential to get there. Features like OCR, GIF recording, and video capture would make FlameShot a better tool. As someone new to Linux, I haven't found any screen capture tool that even comes close to ShareX and CleanShotX.

mmahmoudian · 2024-05-09T07:42:04Z

@f-x1-1

Features like OCR, GIF recording, and video capture would make FlameShot a better tool

As it have been discussed, the dev team unanimously agreed that none of these is in the scope of a screenshot tool. We will only and only allow these via the plugin system.

@X88R88

How do i run flameshot with tesseract on windows?

I have provided some explanation here:
#702 (comment)

f-x1-1 · 2024-05-09T12:02:55Z

@f-x1-1

Features like OCR, GIF recording, and video capture would make FlameShot a better tool

As it have been discussed, the dev team unanimously agreed that none of these is in the scope of a screenshot tool. We will only and only allow these via the plugin system.

@X88R88

How do i run flameshot with tesseract on windows?

I have provided some explanation here:
#702 (comment)

This sounds good. Similar to what Flow Launcher has come up with. Instead of implementing features natively, they have implemented a plugin system. That way, the app itself isn't bloated with unnecessary features. You choose what you want to install. (It's a universal search engine)

Martin-Eckleben added discussion Enhancement Feature requests and code enhancements labels Sep 4, 2020

borgmanJeremy mentioned this issue Feb 18, 2021

Easy OCR #1344

Closed

unikzforce mentioned this issue Feb 13, 2022

Google Lens Screenshot Translation overlay #2420

Closed

mmahmoudian added the Plugin Issues related to plugins and API system label Mar 15, 2022

mmahmoudian mentioned this issue Jul 21, 2022

[feature request] text recognition from clipping #2814

Closed

This comment was marked as off-topic.

Sign in to view

rsrdesarrollo pushed a commit to rsrdesarrollo/flameshot that referenced this issue Jan 27, 2023

fix flameshot-org#702 add OCR with tesseract

9af54b3

rsrdesarrollo linked a pull request Jan 27, 2023 that will close this issue

Add OCR acction with tessercat #3074

Open

R8s6 mentioned this issue Mar 14, 2024

1-line command to enter "Take Screenshot"? RajSolai/TextSnatcher#26

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OCR to clipboard hook for selections #702

OCR to clipboard hook for selections #702

kaihendry commented Mar 19, 2020

MyriaCore commented Sep 4, 2020

nsa commented Sep 12, 2020

JScriber commented Oct 8, 2020

enoy19 commented Jan 15, 2021

xeaon commented Mar 31, 2021 •

edited

Loading

kaihendry commented May 22, 2021

Pheggas commented Nov 10, 2021

xeaon commented Nov 10, 2021

borgmanJeremy commented Nov 10, 2021

Pheggas commented Nov 10, 2021

leoneivaw commented Nov 21, 2021

irfan798 commented Mar 9, 2022

ek1ng commented May 22, 2022

siva-sub commented Jul 19, 2022

xeaon commented Jul 20, 2022

This comment was marked as off-topic.

mmahmoudian commented Nov 4, 2022

Nasreddine commented Dec 15, 2023 •

edited

Loading

Sheldonimo commented Jan 6, 2024 •

edited

Loading

erfanium commented Jan 28, 2024

GrabbenD commented Apr 21, 2024 •

edited

Loading

X88R88 commented May 8, 2024

f-x1-1 commented May 9, 2024

mmahmoudian commented May 9, 2024

f-x1-1 commented May 9, 2024

OCR to clipboard hook for selections #702

OCR to clipboard hook for selections #702

Comments

kaihendry commented Mar 19, 2020

MyriaCore commented Sep 4, 2020

nsa commented Sep 12, 2020

JScriber commented Oct 8, 2020

enoy19 commented Jan 15, 2021

xeaon commented Mar 31, 2021 • edited Loading

kaihendry commented May 22, 2021

Pheggas commented Nov 10, 2021

xeaon commented Nov 10, 2021

borgmanJeremy commented Nov 10, 2021

Pheggas commented Nov 10, 2021

leoneivaw commented Nov 21, 2021

irfan798 commented Mar 9, 2022

ek1ng commented May 22, 2022

siva-sub commented Jul 19, 2022

xeaon commented Jul 20, 2022

This comment was marked as off-topic.

mmahmoudian commented Nov 4, 2022

Nasreddine commented Dec 15, 2023 • edited Loading

Sheldonimo commented Jan 6, 2024 • edited Loading

erfanium commented Jan 28, 2024

GrabbenD commented Apr 21, 2024 • edited Loading

X88R88 commented May 8, 2024

f-x1-1 commented May 9, 2024

mmahmoudian commented May 9, 2024

f-x1-1 commented May 9, 2024

xeaon commented Mar 31, 2021 •

edited

Loading

Nasreddine commented Dec 15, 2023 •

edited

Loading

Sheldonimo commented Jan 6, 2024 •

edited

Loading

GrabbenD commented Apr 21, 2024 •

edited

Loading