speech

中国大学MOOC--英语演讲技巧与实训 speech

中国大学MOOC--英语演讲技巧与实训【来源： | 发布日期：2023-05-24】课程概述张春敏教授（Alice Zhang），主讲《基础英语公众演讲》、《高级英语公众演讲》和《公众演讲与口译》课程，中南大学英语演讲比赛、口译比赛培训金牌教练，从事英语演讲与口译教学研究与竞赛培训十余年，20 ......

技巧 speech 大学 MOOC更新时间 2024-01-06

ChatGPT 实时语音交流, speech-to-text and text-to-speech

前言如果期望与 ChatGPT 进行实时的语音交流，可以直接使用 ChatGPT 的 APP 就可以了，本文完。😂 当然，这需要每月 20 美刀。如果只是想偶尔使用，似乎用 API 的方式更划算。应该有已经封装好的，可以直接调用 API 进行实时语音交流的工具，暂时没找到满意的，求推荐。 sp ......

speech text speech-to-text text-to-speech 实时更新时间 2023-11-14

react native 使用 Expo Speech 文字转语音

安装： npx expo install expo-speech 引入使用： import * as React from 'react'; import { View, StyleSheet, Button } from 'react-native'; import * as Speech fro ......

语音文字 native Speech react更新时间 2023-11-09

语音合成技术5：Disentanglement in a GAN for Unconditional Speech Synthesis

Disentanglement in a GAN for Unconditional Speech Synthesis 在无条件语音合成中的GAN解缠摘要— 我们是否可以开发一个模型，可以直接从潜在空间合成逼真的语音，而无需明确的条件？尽管在过去的十年里进行了多次尝试，以对抗和扩散为基础的方法仍然 ......

Disentanglement Unconditional Synthesis 语音 Speech更新时间 2023-08-22

微软的文本转语音服务Microsoft.CognitiveServices.Speech

微软的Edge 浏览器里的大声朗读里-“晓晓” 很接近自然人，比起其它平台的强很多。在AZURE 可免费体验，每月限额50万字，每个语音转换不超过10分钟长度。 C# 调用： using System; using System.Collections.Generic; using System. ......

语音服务 CognitiveServices Microsoft 语音文本更新时间 2023-08-19

C# 微软Speech文字转语音TTS

.net 4.0 以上第一步引用 System.Speech 代码如下 using System;using System.Collections.Generic;using System.Text;using System.IO;using System.Threading;using Spe ......

语音文字 Speech TTS更新时间 2023-08-18

C# 开发微软Speech 语音识别

.net 4.0 以上第一步引用System.Speech 代码如下 using System.Speech.Recognition;using System.Speech.Synthesis; using System.Globalization;using System.IO; privat ......

语音 Speech更新时间 2023-08-18

微软的文本转语音服务Microsoft.CognitiveServices.Speech

微软的Edge 浏览器里的大声朗读里-“晓晓” 很接近自然人，比起其它平台的强很多。在AZURE 可免费体验，每月限额50万字，每个语音转换不超过10分钟长度。 C# 调用： 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 2 ......

语音服务 CognitiveServices Microsoft 语音文本更新时间 2023-08-18

c# system.speech语音识别

在 .net 4.0 添加引用system.speech.dll using System.Speech.Recognition; //创建语音识别引擎 SpeechRecognitionEngine recognitionEngine = new SpeechRecognitionEngine() ......

语音 system speech更新时间 2023-08-18

python: Text-to-Speech and Speech-to-Text

""" python.exe -m pip install --upgrade pip pip install pyttsx3 pip install comtypes pip install Pillow pip install requests pip install PocketSphinx ......

Speech Text Text-to-Speech Speech-to-Text python更新时间 2023-08-05

Text To Speech（文本转语音）

## 项目简介项目中有一部分需要将文本文字进行语音播放，但在网络上查询了很多，发现很多都要注册或者压根就不能用。这时，我考虑自己写一个文本语音播报软件，既可以根据自定义化，还能提高编码水平。 ## 项目实现由于使用**Windows 10**系统，官方语音库肯定是最适配的。库文件包括：`#in ......

语音文本 Speech Text To更新时间 2023-08-02

论文翻译：GESPER: A UNIFIED FRAMEWORK FOR GENERAL SPEECH RESTORATION

摘要本文描述了-腾讯团队提交给ICASSP 2023语音信号改善(SSI)挑战赛的实时通用语音恢复(Gesper)系统。该系统采用两阶段结构，首先进行语音恢复，然后进行语音增强。我们首次提出了一种基于复杂频谱映射的生成对抗网络(CSM-GAN)作为语音恢复模块。针对噪声抑制和去噪，提出了全带宽并行 ......

论文翻译 RESTORATION FRAMEWORK GENERAL UNIFIED更新时间 2023-08-01

论文翻译：SSI-Net: A MULTI-STAGE SPEECH SIGNAL IMPROVEMENT SYSTEM FOR ICASSP 2023

摘要 ICASSP 2023语音信号改善(SSI)挑战赛的重点是提高实时通信(RTC)系统的语音信号质量。本文介绍了提交ICASSP 2023 SSI挑战赛的语音信号改进网络(SSI-Net)，该网络满足实时条件。提出的SSI-Net具有多阶段体系结构。在语音恢复的第一阶段，我们提出了时域恢复生成对 ......

论文翻译 MULTI-STAGE IMPROVEMENT SSI-Net ICASSP更新时间 2023-08-01

speech用法

speech意为言论、口语、说话的方式、能力时，是不可数名词；意为演讲、讲话、台词时，是可数名词，其复数为speeches。发音为：英【spi:t】；美【spi:t】。 speech的用法 1、speech n.演讲、演说、发言，是可数名词，复数是speeches speech on/about s ......

speech更新时间 2023-07-29

Microsoft Speech SDK 5.1 微软的文字转音频 ( 8KHZ 16比特 )

下载安装 Speech SDK 5.1 下载地址： http://www.microsoft.com/en-us/download/details.aspx?id=10121 详细的看这篇 https://www.cnblogs.com/hailexuexi/p/17588586.html C#示例 ......

Microsoft 音频文字 Speech 8KHZ更新时间 2023-07-28

Microsoft Speech SDK 5.1 微软的文字转语音TTS

下载安装 Speech SDK 5.1 1. Windows Speech SDK 5.1版本支持xp系统和server 2003系统，需要下载安装。XP系统默认只带了个Microsoft Sam英文男声语音库，想要中文引擎就需要安装Windows Speech SDK 5.1。下载地址：http: ......

Microsoft 语音文字 Speech 5.1更新时间 2023-07-28

论文翻译（扩散模型来了）：Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found Data

利用发现的数据来创建合成声音是具有挑战性的，因为现实世界的录音通常包含各种类型的音频退化。解决这个问题的一种方法是使用增强模型对语音进行预增强，然后使用增强后的数据进行文本转语音（TTS）模型训练。本论文研究了使用条件扩散模型进行广义语音增强，旨在同时解决多种类型的音频退化。增强是在对数Mel频谱领 ......

论文翻译 Diffusion-Based Mel-Spectrogram Personalized Enhancement更新时间 2023-07-26

PR语音转字幕转换插件Speech to Text for Premiere Pro

在 Speech to Text for Premiere Pro(PR语音转字幕转换插件中您可以使用以下各种格式转换：中文(PL/PRC)、英文、日语、韩语、意大利语、葡萄牙语、波兰语、法语、意大利语、荷兰语、英语、西班牙语等。如果您对中文、日语、韩语、葡萄牙语、法语、荷兰语等语言感兴趣，可以在这 ......

字幕插件语音 Premiere Speech更新时间 2023-07-24

LHY2022-HW02-Speech Recognition

# 1. 实验结果纪录纪录一下调整参数带来的结果.不过语音识别这块完全不熟. # 1.1 Simple Baseline * **acc>0.45797** 直接上传助教代码 ![image](https://img2023.cnblogs.com/blog/2264614/202306/2264 ......

Recognition Speech 2022 LHY 02更新时间 2023-06-15

.NET使用System.Speech轻松读取文本

System.Speech是.NET框架的一部分，提供了语音识别和语音合成的功能。通过使用System.Speech命名空间中的类，开发人员可以在.NET应用程序中实现语音识别功能。在本文中，我将演示如何使用 System.Speech.NET，这是开发语音应用程序比较牛逼的内库。它适用于 .NE ......

文本 System Speech NET更新时间 2023-06-05

语音识别，语音转文字，会议记录自动化，Meeting Note, Speech to Note

经过百般测试，实践了Python的方案，实现：可以识别英语，但是断句和整句话的整理还是不尽人意。还不如下面这个产品 Speechnotes https://speechnotes.co/dictate/ Pyhton的方案实践记录（部分）： cd /Users/***/opt/anaconda3/ ......

语音会议记录 Note Meeting 文字更新时间 2023-05-31

PR语音转字幕转换插件Speech to Text for Premiere Pro

字幕插件语音 Premiere Speech更新时间 2023-04-24

论文翻译：2023_THLNet: two-stage heterogeneous lightweight network for monaural speech enhancement

论文地址：THLNet: 用于单耳语音增强的两级异构轻量级网络代码：https://github.com/dangf15/THLNet 引用格式：Dang F, Hu Q, Zhang P. THLNet: two-stage heterogeneous lightweight network f ......

论文翻译 heterogeneous enhancement lightweight two-stage更新时间 2023-03-22

口播神器,基于Edge,微软TTS(text-to-speech)文字转语音免费开源库edge-tts实践(Python3.10)

不能否认，微软Azure在TTS(text-to-speech文字转语音)这个人工智能细分领域的影响力是统治级的，一如ChatGPT在NLP领域的随心所欲，予取予求。君不见几乎所有的抖音营销号口播均采用微软的语音合成技术，其影响力由此可见一斑，仅有的白璧微瑕之处就是价格略高，虽然国内也可以使用科大讯 ......

神器 text-to-speech 语音 edge-tts Python3更新时间 2023-03-22

论文翻译：2022_Phase-Aware Deep Speech Enhancement: It's All About The Frame Length

论文地址：相位感知深度语音增强:这完全取决于帧长引用格式：Peer T, Gerkmann T. Phase-aware deep speech enhancement: It's all about the frame length[J]. JASA Express Letters, 2022, ......

论文翻译 Phase-Aware Enhancement Length Speech更新时间 2023-03-22

论文翻译：2022_2022_TEA-PSE 2.0：Sub-Band Network For Real-Time Personalized Speech Enhancement

论文地址：TEA-PSE 2.0：用于实时个性化语音增强的子带网络论文代码：引用：摘要个性化语音增强(Personalized speech enhancement，PSE)利用额外的线索，如说话人embeddings来去除背景噪声和干扰语音，并从目标说话人提取语音。此前，Tencent - ......

论文翻译 2022 Personalized Enhancement Real-Time更新时间 2023-03-22

论文翻译：2022_腾讯DNS 1th TEA-PSE: Tencent-ethereal-audio-lab personalized speech enhancement system for ICASSP 2022 DNS CHALLENGE

论文地址：TEA-PSE: 用于ICASSP 2022 DNS挑战赛的Tencent-ethereal-audio-lab 个性化语音增强系统论文代码：引用格式：Ju Y, Rao W, Yan X, et al. TEA-PSE: Tencent-ethereal-audio-lab pers ......

Tencent-ethereal-audio-lab 论文翻译 2022 personalized enhancement更新时间 2023-03-22

论文翻译：2022_PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement

博客地址：凌逆战 (转载请注明出处) 论文地址：PercepNet+: 用于实时语音增强的相位和信噪比感知 PercepNet 引用格式： Ge X, Han J, Long Y, et al. PercepNet+: A Phase and SNR Aware PercepNet for Real ......

PercepNet 论文翻译 Enhancement Real-Time Speech更新时间 2023-03-22

论文翻译：2022_DNS_1th：Multi-scale temporal frequency convolutional network with axial attention for speech enhancement

论文地址：带轴向注意的多尺度时域频率卷积网络语音增强论文代码：https://github.com/echocatzh/MTFAA-Net 引用：Zhang G, Yu L, Wang C, et al. Multi-scale temporal frequency convolutional n ......

论文翻译 convolutional Multi-scale enhancement frequency更新时间 2023-03-22

共29篇 :1/1页 首页上一页1下一页尾页