Refactor inference processes & add new engines (FasterWhisper, vLLM) #141

ssh-meister · 2025-07-06T09:57:37Z

Description

Refactored inference-related processes into a separate group, mirroring the subgroup structure used in the NeMo repository (subgroup -> task, e.g., "asr", "nlp", etc.).
Within each subgroup, the processors are further organized by the type of engine required to run them.

New processors added:

FasterWhisperInference — based on SYSTRAN/faster-whisper
vLLMInference — based on vllm-project/vllm

New post-processing processors:

DetectWhisperHallucinationFeatures
CleanQwenGeneration

Misc:

Fixed docs build issues from Portuguese #77

Signed-off-by: Sasha Meister <[email protected]>

sdp/processors/inference/asr/post_processing/whisper_hallucinations.py

lilithgrigoryan · 2025-07-21T14:42:52Z

sdp/processors/inference/llm/post_processing/qwen_cleaning.py

+        Determine if generation should be replaced with reference text based on
+        CER and uppercase ratio.
+        """
+        chars = generation.replace(' ', '')


Why do we need this chars here? is it necessary to remove blanks?

@lilithgrigoryan, thanks for the review!
This processor is used to select either the original text or a Qwen generation with restored punctuation.
One of the selection criteria is that if the model over-capitalizes the text (above a specified upper_case_threshold), we consider the generation poor.
To check this, we look only at non-space characters to compute the ratio of capital to lowercase letters.

sdp/processors/inference/asr/faster_whisper/faster_whisper_inference.py

tests/test_data_to_data.py

Signed-off-by: Sasha Meister <[email protected]>

ssh-meister added 2 commits July 6, 2025 09:50

Group inference processors

4c9a278

Signed-off-by: Sasha Meister <[email protected]>

Add requirements

e304efd

Signed-off-by: Sasha Meister <[email protected]>

ssh-meister requested a review from lilithgrigoryan July 6, 2025 09:57

ssh-meister self-assigned this Jul 6, 2025

ssh-meister mentioned this pull request Jul 6, 2025

Granary Dataset Processing (Component-Based) #135

Open

ssh-meister and others added 6 commits July 6, 2025 10:05

Remove outdated import

2d2127c

Signed-off-by: Sasha Meister <[email protected]>

optional to py:class

1d2efdb

Signed-off-by: Sasha Meister <[email protected]>

Merge branch 'main' into Inference

c58f37a

Estimate bandwith moved to data_to_data.py

3d8530f

Signed-off-by: Sasha Meister <[email protected]>

Update link in docs

4cf80a1

Signed-off-by: Sasha Meister <[email protected]>

Merge remote-tracking branch 'origin/main' into Inference

6c96e97

Signed-off-by: Sasha Meister <[email protected]>

lilithgrigoryan requested changes Jul 21, 2025

View reviewed changes

tests/test_data_to_data.py Show resolved Hide resolved

ssh-meister added 3 commits July 22, 2025 01:16

Missed level of folders added (nlp/nemo)

4941f16

Signed-off-by: Sasha Meister <[email protected]>

Changes addressing the reviewer’s comments

fd6fe10

Signed-off-by: Sasha Meister <[email protected]>

fix docs

672d500

Signed-off-by: Sasha Meister <[email protected]>

ssh-meister requested a review from lilithgrigoryan July 22, 2025 11:04

lilithgrigoryan approved these changes Jul 22, 2025

View reviewed changes

ssh-meister added 2 commits July 23, 2025 01:32

Merge remote-tracking branch 'origin/main' into Inference

9071b07

Signed-off-by: Sasha Meister <[email protected]>

Fixed import of rttm processors

c7e319d

Signed-off-by: Sasha Meister <[email protected]>

ssh-meister merged commit 93cfc46 into main Jul 23, 2025
10 checks passed

ssh-meister deleted the Inference branch July 23, 2025 10:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor inference processes & add new engines (FasterWhisper, vLLM) #141

Refactor inference processes & add new engines (FasterWhisper, vLLM) #141

Uh oh!

ssh-meister commented Jul 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

lilithgrigoryan Jul 21, 2025

Uh oh!

ssh-meister Jul 22, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Refactor inference processes & add new engines (FasterWhisper, vLLM) #141

Refactor inference processes & add new engines (FasterWhisper, vLLM) #141

Uh oh!

Conversation

ssh-meister commented Jul 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

New processors added:

New post-processing processors:

Misc:

Uh oh!

Uh oh!

lilithgrigoryan Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

ssh-meister Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ssh-meister commented Jul 6, 2025 •

edited

Loading