-
Notifications
You must be signed in to change notification settings - Fork 121
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added example: meeting conversations extractor #1009
base: main
Are you sure you want to change the base?
Added example: meeting conversations extractor #1009
Conversation
This reverts commit 570371e.
…o meeting_conversations_extractor
@Ashish-Abraham How is this going? Are you blocked on antyhing? |
No issues @diptanu . Was a little busy. Will complete it soon. Thanks! |
@Ashish-Abraham Did you see our example here - https://github.com/tensorlakeai/indexify/tree/main/examples/video_summarization -- I am wondering what is the difference in this demo vs what's on there? |
Sorry. Added the wrong file. Here we are extracting the summary in structured format defined by the schema of each meeting type. This data structure can be passed to the frontend or processed further in any manner required. Please check. Should I convert to JSON or sth? |
@Ashish-Abraham Yeah if you use JSON it might be easier for people to consume the workflow using HTTP APIs directly. Add After that you could do something like to invoke the workflow
and
I don't quite remember the APIs correctly, they are in code and on our website. |
I cant find the page you are referring to. Is this the page? https://docs.tensorlake.ai/api-reference/documents/extract/extract-file-sync. Could you please guide me a bit on how to do this? |
Context
Valuable business insights are often hidden in daily conversations across organizations, from customer interactions to internal meetings. Had an idea to develop something using Indexify that helps extract and utilize this data effectively.
Here I have added an example use-case. This is a conversation extractor that uses a custom indexify extraction graph to extract summarized data in a structured format from long meeting audio files.
What
The extractor workflow is as given:
Audio Processing:
Content Analysis:
Based on the meeting type classification, the system generates structured summaries:
You can tweak the fields to extract whatever data needed.
Sample Outputs
Testing
Local Installation - In Process
Clone this repository:
Create a virtual environment and activate it:
Install the required dependencies:
Run the main script:
Contribution Checklist
make fmt
inpython-sdk/
.make fmt
inserver/
.