-
-
Notifications
You must be signed in to change notification settings - Fork 11.3k
[Model] Extend Ultravox to accept audio longer than 30s #13631
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 20 commits
Commits
Show all changes
30 commits
Select commit
Hold shift + click to select a range
cc2df56
update ultravox to accept more than 30s audio
farzadab c7e0329
temporarily use model with updated processor for tests
farzadab 0c5363e
fix collation
farzadab 189f5cc
revert audio_replacement -> audio_token_replacement
farzadab bc3ba8c
increase max mm tokens
farzadab 618e752
increase max mm tokens
farzadab 0e62945
reduce max mm tokens
farzadab 69278e2
revert increasing max mm tokens
farzadab 788fd59
Merge remote-tracking branch 'upstream/main' into farzad-long-audio
farzadab 75c138b
fix <|begin_of_text|> not being included
farzadab 3b0e237
batching for whisper to avoid oom
farzadab 97f6f5b
add comment
farzadab bea5a31
use flat_from_sizes for ultravox mm_fields_config
farzadab 28f16ce
revert ultravox test model id
farzadab 48c359b
improve documentation for double bos_id case
farzadab e920ab9
Merge remote-tracking branch 'upstream/main' into farzad-long-audio
farzadab 4a54ea1
do not use vocab in get_hf_processor
farzadab 347ada8
revert tests to use v0_5
farzadab e829dac
Merge remote-tracking branch 'upstream/main' into farzad-long-audio
farzadab b04878e
revert tests to use v0_5
farzadab 631487f
adding tests for both ultravox v0.4 and v0.5
farzadab a9828ea
handle audio_num_chunks when no audio is passed
farzadab 33a9cf0
drop test for ultravox v0_4
farzadab 7ca61cf
drop matching Ultravox audio_features with cache
farzadab 48f7da3
ignore exact match for audio_features in _items_by_modality
farzadab 2813a47
fix type hint
farzadab 66c10e4
Merge remote-tracking branch 'vllm-base/main' into farzad-long-audio
farzadab 11ff27f
debug logs for ci
farzadab 2776a31
if all else fails just stack?
farzadab 3cb3583
recursive pad_and_concat
farzadab File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.