paddle speech Logo
latest

Introduction

  • PaddleSpeech

Quick Start

  • Installation
  • Quick Start of Speech-to-Text
  • Quick Start of Text-to-Speech

Speech-to-Text

  • Models introduction
  • Data Preparation
  • Features
  • Ngram LM

Text-to-Speech

  • Advanced Usage
  • Chinese Rule-Based Text Frontend
  • Models introduction
  • GAN Vocoders
  • Audio Sample
  • Audio Sample (PaddleSpeech TTS VS Espnet TTS)

Released Models

  • Released Models

Demos

  • Demo Video
  • Streaming ASR Demo Video
  • TTS Demo Video
  • Streaming TTS Demo Video

API Reference

  • paddleaudio
  • paddlespeech.audio
  • paddlespeech.cli
  • paddlespeech.cls
  • paddlespeech.kws
  • paddlespeech.resource
  • paddlespeech.s2t
  • paddlespeech.server
  • paddlespeech.t2s
  • paddlespeech.text
  • paddlespeech.vector
    • Subpackages
      • paddlespeech.vector.cluster package
      • paddlespeech.vector.exps package
      • paddlespeech.vector.io package
        • Submodules
      • paddlespeech.vector.models package
      • paddlespeech.vector.modules package
      • paddlespeech.vector.training package
      • paddlespeech.vector.utils package
paddle speech
  • paddlespeech.vector package
  • paddlespeech.vector.io package
  • Edit on GitHub

paddlespeech.vector.io package

Submodules

  • paddlespeech.vector.io.augment module
    • AddBabble
      • AddBabble.forward()
    • AddNoise
      • AddNoise.forward()
    • AddReverb
      • AddReverb.forward()
    • DropChunk
      • DropChunk.forward()
    • DropFreq
      • DropFreq.forward()
    • EnvCorrupt
      • EnvCorrupt.forward()
    • Resample
      • Resample.forward()
    • SpeedPerturb
      • SpeedPerturb.forward()
    • TimeDomainSpecAugment
      • TimeDomainSpecAugment.forward()
    • build_augment_pipeline()
    • waveform_augment()
  • paddlespeech.vector.io.batch module
    • batch_feature_normalize()
    • batch_pad_right()
    • feature_normalize()
    • pad_right_2d()
    • pad_right_to()
    • waveform_collate_fn()
  • paddlespeech.vector.io.dataset module
    • CSVDataset
      • CSVDataset.convert_to_record()
      • CSVDataset.load_data_csv()
      • CSVDataset.load_speaker_to_label()
    • meta_info
      • meta_info.duration
      • meta_info.label
      • meta_info.start
      • meta_info.stop
      • meta_info.utt_id
      • meta_info.wav
  • paddlespeech.vector.io.dataset_from_json module
    • JSONDataset
    • meta_info
      • meta_info.duration
      • meta_info.record_id
      • meta_info.start
      • meta_info.stop
      • meta_info.utt_id
      • meta_info.wav
  • paddlespeech.vector.io.embedding_norm module
    • InputNormalization
      • InputNormalization.save()
      • InputNormalization.spk_dict_count
      • InputNormalization.spk_dict_mean
      • InputNormalization.spk_dict_std
      • InputNormalization.to()
  • paddlespeech.vector.io.signal_processing module
    • blackman_window()
    • compute_amplitude()
    • convolve1d()
    • dB_to_amplitude()
    • normalize()
    • notch_filter()
    • rescale()
    • reverberate()
Previous Next

© Copyright 2021, paddlespeech-developers. Revision d8bf8c6f.

Built with Sphinx using a theme provided by Read the Docs.