multimodal_fin package
Subpackages
- multimodal_fin.embeddings package
- Subpackages
- multimodal_fin.embeddings.builder package
- Submodules
- multimodal_fin.embeddings.builder.conference_encoder module
- multimodal_fin.embeddings.builder.feature_extractor module
- multimodal_fin.embeddings.builder.node_encoder module
- multimodal_fin.embeddings.builder.pipeline module
- multimodal_fin.embeddings.builder.sentence_attention_encoder module
- multimodal_fin.embeddings.builder.transformer_encoder module
- Module contents
- multimodal_fin.embeddings.speech_tree package
- multimodal_fin.embeddings.visualizer package
- multimodal_fin.embeddings.builder package
- Module contents
- Subpackages
- multimodal_fin.processing package
- Subpackages
- multimodal_fin.processing.metadata package
- Submodules
- multimodal_fin.processing.metadata.coherence_analyzer module
- multimodal_fin.processing.metadata.metadata_enricher module
- multimodal_fin.processing.metadata.prompt_builder module
- multimodal_fin.processing.metadata.qa_analyzer module
- multimodal_fin.processing.metadata.sec10k_analyzer module
- Module contents
- multimodal_fin.processing.multimodal package
- multimodal_fin.processing.preprocessing package
- Submodules
- multimodal_fin.processing.preprocessing.ensemble_classifier module
- multimodal_fin.processing.preprocessing.monologue_classifier module
- multimodal_fin.processing.preprocessing.preprocessor module
- multimodal_fin.processing.preprocessing.qa_classifier module
- multimodal_fin.processing.preprocessing.transcript_preprocessor module
- Module contents
- multimodal_fin.processing.metadata package
- Submodules
- multimodal_fin.processing.basics module
- multimodal_fin.processing.pipeline module
- multimodal_fin.processing.processor module
- Module contents
- Subpackages
- multimodal_fin.runners package
- multimodal_fin.utils package
Submodules
multimodal_fin.cli module
Command-line interface for the multimodal_fin package.
This script defines the main CLI entry points using Typer, allowing users to: - Process conference data - Generate embeddings - Download transcripts and audio
Each command loads its corresponding configuration section from a YAML file.
- multimodal_fin.cli.download(config_file, config_name='default', url=None)[source]
Download transcripts and audio from EarningsCall.biz for S&P500 companies.
- Parameters:
config_file (
Path) – Path to the YAML configuration file.config_name (
str) – Name of the config block under ‘conferences_data_adquisition’. Defaults to “default”.url (
str) – Optional override of the default S&P500 earnings call URL.
- Return type:
None
- multimodal_fin.cli.embed(config_file, config_name='default', json_path=None, json_csv=None)[source]
Generate hierarchical multimodal embeddings from enriched JSON files.
- Parameters:
config_file (
Path) – Path to the YAML configuration file.config_name (
str) – Name of the config block under ‘embeddings_pipeline’. Defaults to “default”.json_path (
Path) – Path to a single transcript.json file.json_csv (
Path) – Path to a CSV containing paths to multiple transcript.json files.
- Return type:
None
- multimodal_fin.cli.main()[source]
Main entry point for the CLI when invoked directly.
- Return type:
None
- multimodal_fin.cli.process(config_file, config_name='default')[source]
Run the full pipeline: QA/monologue classification and enrichment.
- Parameters:
config_file (
Path) – Path to the YAML configuration file.config_name (
str) – Name of the config block under ‘conferences_processing’. Defaults to “default”.
- Return type:
None
multimodal_fin.config module
Configuration loader and schema definitions for the multimodal_fin package.
This module provides Pydantic-based configuration classes and utilities to load structured settings from YAML files.
- class multimodal_fin.config.ConferenceEncoderParams(**data)[source]
Bases:
BaseModelModel parameters for the conference-level encoder.
- d_output: int
- input_dim: int
- model_config: ClassVar[ConfigDict] = {}
Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].
- n_heads: int
- weights_path: str
- class multimodal_fin.config.DataAdquisitionSettings(**data)[source]
Bases:
BaseModelSettings for downloading earnings call data.
- api_key: str
- base_path: str
- model_config: ClassVar[ConfigDict] = {}
Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].
- url: str
- class multimodal_fin.config.EmbeddingsPipelineSettings(**data)[source]
Bases:
BaseModelSettings for the full embedding pipeline.
- conference_encoder: ConferenceEncoderParams
- device: str | None
- model_config: ClassVar[ConfigDict] = {}
Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].
- node_encoder: NodeEncoderParams
- class multimodal_fin.config.FullConfig(**data)[source]
Bases:
BaseModelAggregated configuration object including all pipeline components.
- data_adquisition: DataAdquisitionSettings | None
- embeddings: EmbeddingsPipelineSettings | None
- model_config: ClassVar[ConfigDict] = {}
Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].
- class multimodal_fin.config.NodeEncoderParams(**data)[source]
Bases:
BaseModelModel parameters for the node-level encoder.
- d_output: int
- meta_dim: int
- model_config: ClassVar[ConfigDict] = {}
Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].
- n_heads: int
- weights_path: str
- class multimodal_fin.config.Settings(**data)[source]
Bases:
BaseModelSettings for the full conference processing pipeline.
- audio_model: str | None
- device: str
- evals: int
- input_csv_path: str
- model_config: ClassVar[ConfigDict] = {}
Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].
- monologue_models: List[str]
- qa_analyzer_models: List[str]
- qa_models: List[str]
- sec10k_models: List[str]
- text_model: str | None
- verbose: int
- video_model: str | None
- multimodal_fin.config.default_device()[source]
Return the default device based on PyTorch availability.
- Returns:
‘cuda’ if a CUDA-enabled GPU is available, otherwise ‘cpu’.
- Return type:
str
- multimodal_fin.config.load_full_config(config_path, config_name='default', override_url=None)[source]
Load all pipeline configuration components from YAML.
- Parameters:
config_path (
str) – Path to the YAML configuration file.config_name (
str) – Name of the config block for each section. Defaults to “default”.override_url (
Optional[str]) – Optional override for the download URL.
- Returns:
Aggregated configuration object.
- Return type: