Modal Configuration¶

`modal_processing` ¶

Modal integration of Mirumoji API.

This module defines the modal integration and the code to be run in Modal's container.

Attributes:

Name	Type	Description
`LOGGER`	`Logger`	Module's logging object.

Runs Whisper transcription on media_fp, fixes with GPT, and returns SRT string.

Parameters:

Name	Type	Description	Default
`media_fp`	`Union[str, Path]`	Path to the video for transcription.	required
`fwhisper_kwargs`	`dict`	Additional arguments for `FWhisperWrapper`	`{}`
`transcribe_kwargs`	`dict`	Additional arguments for `FWhisperWrapper.transcribe`	`{}`
`fix_with_chat_gpt`	`bool`	If `True` request ChatGPT to fix transcription. Defaults to True	`True`

Returns:

Type	Description
`Optional[str]`	Transcription in form of SRT string.

Transcribe audio to string using Faster Whisper.

Parameters:

Name	Type	Description	Default
`audio_fp`	`Union[str, Path]`	Path to the audio for transcription.	required
`fwhisper_kwargs`	`dict`	Additional arguments for `FWhisperWrapper`	`{}`
`transcribe_kwargs`	`dict`	Additional arguments for `FWhisperWrapper.transcribe`	`{}`

Returns:

Name	Type	Description
`str`	`str`	Transcription in form of string.

Converts video_fp to MP4 using NVENC and returns the video content as bytes.

Parameters:

Name	Type	Description	Default
`video_fp`	`Union[str, Path]`	Path to the video for conversion.	required
`to_mp4_kwargs`	`dict`	Additional arguments for `AudioTools.to_mp4`.	`{}`

Yields:

Name	Type	Description
`bytes`	`bytes`	The converted video chunks.