-
Hi, Awesome work with Whisper and many thanks for putting this out there. It definitely will advance a lot of speech-related applications. I'm building a smartglasses tool that helps people with a visual impairment and one of the most preferred forms of interaction for blind users is through voice. I would like to explore Whisper to run on our Android-based Glasses so want to check If there's an Android port in the works? If not, any high-level ideas on how we can adopt this model on run on mobile? Thanks! |
Beta Was this translation helpful? Give feedback.
Replies: 19 comments 23 replies
-
We haven't tried mobile applications, but you could try TorchScript + PyTorch mobile or ONNX export + ONNX mobile runtime. I expect some difficulties porting the decoding logic, if the mobile environment does not allow running Python. |
Beta Was this translation helpful? Give feedback.
-
Build something on top of this to run on Android |
Beta Was this translation helpful? Give feedback.
-
Refer below for tflite model files |
Beta Was this translation helpful? Give feedback.
-
@zhuzilin @ArtyomZemlyak |
Beta Was this translation helpful? Give feedback.
-
I got Whisper working on iOS (android is probably easier) by converting the (small) model to CoreML packages in python with the coremltools convert function, as well as writing quite a bit of Swift to them in my scenario. It's not super fast, and the app size is around 500MB using the small model; I haven't looked at the issues on older iOS devices yet (crashing). A few considerations depend on what you're trying to achieve - there's the speech framework on iOS, for instance (but it recommends an internet connection) which may be faster, more real time, support older devices, reduce app size.... You might want to consider https://github.com/snipsco/snips-platform-android-demo as an option reading your case - just throwing it out there for evaluation Here's a fun little post about my adventure so far but happy to have a chat to see if I can help: https://sandigital.uk/blog/openai-whisper-scottish-accent/ |
Beta Was this translation helpful? Give feedback.
-
encoder/decoder tflite(int8) models can be generated with the below script |
Beta Was this translation helpful? Give feedback.
-
here is the colab link to generate whisper.tflite file Updated input shapes using input signature to create a new hybrid whisper.tflite 40MB model (activations are in float32, weights are in int8). I was then able to execute inference with dummy input on whisper.tflite using minimal example from TFLite C++ API on a Linux PC. run the below command to run inference on whisper.tflite Next steps:Feed the audio input->generate spectrograms->feed to model->post process model output in c++ code |
Beta Was this translation helpful? Give feedback.
-
Please follow the README to run whisper.tflite using tflite framework |
Beta Was this translation helpful? Give feedback.
-
In this repository you can find a Flutter port of Whisper: https://github.com/azkadev/whisper_dart |
Beta Was this translation helpful? Give feedback.
-
I developed Android APP based on whisper.tflite (quantized ~40MB) running inference in ~2 seconds for 30 seconds audio clip |
Beta Was this translation helpful? Give feedback.
-
I released Whisper Android App based on Whisper.tflite ~40MB quantized model to the Android App store. Link to Android app store |
Beta Was this translation helpful? Give feedback.
-
This is only a proof-of-concept project to create an Android app based on Whisper TFLite, which leverages the stock Android UI to show off its features. |
Beta Was this translation helpful? Give feedback.
-
https://apps.apple.com/in/app/whisper-asr/id6444556326 Pls download iOS app from apple app store which uses whisper tflite model |
Beta Was this translation helpful? Give feedback.
-
Please feel free to download the openai/whisper-tiny tflite-based Android APP from Google App Store. |
Beta Was this translation helpful? Give feedback.
-
Hey @nyadla-sys, I just came across your application it's awesome, and I want to integrate it into a react native application. Can u plz give me some clues on how to integrate it? |
Beta Was this translation helpful? Give feedback.
-
You may find the source code for a Linux C++ application and an Android Java app on my Github repository at the following link: https://github.com/usefulsensors/openai-whisper. |
Beta Was this translation helpful? Give feedback.
-
hello, Why does the whisper_android app run above android 9.0? |
Beta Was this translation helpful? Give feedback.
-
Which app are you referring to specifically? Please share the link.
…On Sat, 03 Feb, 2024, 11:10 slurpey, ***@***.***> wrote:
I would be definitely willing to pay for your app if it had an automated
copy of the text to the clipboard or a big copy button at the bottom, so I
could use it as a replacement for Google Voice Typing.
—
Reply to this email directly, view it on GitHub
<#11 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AGBLNUDAYXZY2TTD2JCNAU3YRX5IZAVCNFSM6AAAAAAQSNDAMCVHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM4DGNJTGYZDA>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
i dont know do anyown want to use whisper using web browser . and your pc as server https://gitlab.com/krafi/whisperweb |
Beta Was this translation helpful? Give feedback.
We haven't tried mobile applications, but you could try TorchScript + PyTorch mobile or ONNX export + ONNX mobile runtime. I expect some difficulties porting the decoding logic, if the mobile environment does not allow running Python.