Documentation

Reference

Python API
Toggle navigation of Python API
- psifx
  Toggle navigation of psifx
  - audio
    Toggle navigation of audio
    - command
    - diarization
      Toggle navigation of diarization
      - command
      - pyannote
        Toggle navigation of pyannote
        
        command
        
        tool
      - tool
    - identification
      Toggle navigation of identification
      - command
      - pyannote
        Toggle navigation of pyannote
        
        command
        
        tool
      - tool
    - manipulation
      Toggle navigation of manipulation
      - command
      - tool
    - speech
      Toggle navigation of speech
      - command
      - opensmile
        Toggle navigation of opensmile
        
        command
        
        tool
      - tool
    - tool
    - transcription
      Toggle navigation of transcription
      - command
      - tool
      - whisper
        Toggle navigation of whisper
        
        command
        
        tool
  - command
  - io
    Toggle navigation of io
    - csv
    - json
    - rttm
    - tar
    - txt
    - video
    - vtt
    - wav
    - yaml
  - text
    Toggle navigation of text
    - chat
      Toggle navigation of chat
      - command
      - tool
    - command
    - instruction
      Toggle navigation of instruction
      - command
      - tool
    - llm
      Toggle navigation of llm
      - anthropic
        Toggle navigation of anthropic
        
        tool
      - command
      - hf
        Toggle navigation of hf
        
        tool
      - ollama
        Toggle navigation of ollama
        
        tool
      - openai
        Toggle navigation of openai
        
        tool
      - tool
    - tool
  - tool
  - utils
    Toggle navigation of utils
    - command
    - draw
  - video
    Toggle navigation of video
    - command
    - face
      Toggle navigation of face
      - command
      - openface
        Toggle navigation of openface
        
        command
        
        fields
        
        skeleton
        
        tool
      - tool
    - manipulation
      Toggle navigation of manipulation
      - command
      - tool
    - pose
      Toggle navigation of pose
      - command
      - mediapipe
        Toggle navigation of mediapipe
        
        command
        
        skeleton
        
        tool
      - tool
    - tool
    - tracking
      Toggle navigation of tracking
      - command
      - samurai
        Toggle navigation of samurai
        
        command
        
        tool
      - tool
Commands
Toggle navigation of Commands
- audio
- video
- text

Usage¶

Concept¶

psifx is a versatile Python package that can be used both as a library within Python code or as a command-line tool for direct execution.

Library Usage¶

As a library, psifx provides tools for various tasks, including audio processing, video manipulation, and text processing. For example, you can use it for speaker diarization in Python by importing the necessary modules and specifying parameters programmatically:

from psifx.audio.diarization.pyannote.tool import PyannoteDiarizationTool

# Configure a tool with specific settings, such as selecting a neural network model.
tool = PyannoteDiarizationTool(model_name="pyannote/speaker-diarization-3.1")

# Run the inference method on an audio track for speaker segmentation.
tool.inference(audio_path="/path/to/audio.wav", 
               diarization_path="/path/to/diarization.rttm", 
               num_speakers=2)

Command-Line Interface (CLI)¶

psifx also includes a powerful CLI for running commands directly in the terminal. This can simplify workflows by allowing users to specify tasks and parameters without writing any Python code.

For example, to run speaker diarization on an audio file:

psifx audio diarization pyannote inference \
    --audio /path/to/audio.wav \
    --diarization /path/to/diarization.rttm \
    --num_speakers 2

Additional Resources¶

For detailed command usage in specific areas, see:

These guides provide in-depth instructions for working with different data types in psifx, from pre-processing to inference and visualization.