Make model blocks compatible with any frame-wise or chunk-wise model #43

juanmc2005 · 2022-05-04T08:23:28Z

Problem

The current implementation of FrameWiseModel and ChunkWiseModel is highly dependent on pyannote.audio internals and published models. Any model accepted by the pipeline must be loaded with pyannote.audio. This is a problem because users may want to plug in custom models that don't share the same structure as models trained with or published by pyannote.

Idea

Make FrameWiseModel and ChunkWiseModel accept a broad definition of a model, e.g. a Callable[[torch.Tensor], torch.Tensor] or something similar. This way, any model that can ingest a tensor (in a certain format) and return another tensor (also in a certain format) could be plugged in.

The text was updated successfully, but these errors were encountered:

juanmc2005 · 2022-06-09T12:18:25Z

Implemented in #61

juanmc2005 added the feature New feature or request label May 4, 2022

juanmc2005 added this to the Version 0.4 milestone May 11, 2022

juanmc2005 mentioned this issue Jun 8, 2022

Make pyannote.audio optional (still mandatory to run default pipeline) #61

Merged

juanmc2005 closed this as completed Jun 9, 2022

This was referenced Jun 17, 2022

SpeechBrain embedding compatibility #34

Closed

Version 0.4 #66

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make model blocks compatible with any frame-wise or chunk-wise model #43

Make model blocks compatible with any frame-wise or chunk-wise model #43

juanmc2005 commented May 4, 2022

juanmc2005 commented Jun 9, 2022

Make model blocks compatible with any frame-wise or chunk-wise model #43

Make model blocks compatible with any frame-wise or chunk-wise model #43

Comments

juanmc2005 commented May 4, 2022

Problem

Idea

juanmc2005 commented Jun 9, 2022