Grace_yanyanyan-CSDN博客

原创 20190510 语音识别资源整理

语音处理课程推荐|Speech Processing（2019）台师大Speech Processing。国立台湾师范大学的陈柏琳教授。http://berlin.csie.ntnu.edu.tw/Courses/Speech Processing/Speech Processing_Main_2019S.htm陈教授教学多年，主页上还有好多其他课程。http://berlin.csi...

2020-04-13 18:39:37 5482 4

翻译 Attention模型的调查报告（有翻译）An Attentive Survey of Attention Models

这是arxiv上19年4月的论文，花了一天时间翻译了一下，但是感觉对于我这样的小白来说，然并卵。下载地址：

2019-04-17 17:18:48 564

原创 win10安装tpshop遇到的修改sql_mode的问题

win10安装tpshop遇到的修改sql_mode的问题

2023-02-03 11:46:55 244

原创 macbook安装win10没有声音

macbook安装win10没有声音我的macbook是2015年款找到右下角声音图标，右键，选择声音，选择播放选项卡，我的播放选项卡是这个样子：扬声器、数字音频之类都可以点击右键—测试，测试哪个有声音就给它设置成默认设备就好了。我的一开始默认的是数字音频，但是测试没有声音，扬声器测试的时候就有声音，设成默认以后就好了。...

2021-03-11 21:01:30 8744

原创 oppoA57 连上电脑之后没反应

今天要给妈妈的手机做照片备份，结果oppoA57 连上电脑之后没反应，看不见手机的内存卡，找了一大圈办法，弄各种驱动都没有解决。后来才发现oppo手机自带了一个app叫文件管理，里面有个远程管理，打开服务之后，只要电脑和手机在一个网络下，电脑就可以看见手机里的资料了。买手机之后第一次用的时候删掉了一大堆oppo自带的软件，幸亏这个app没有删。...

2020-10-14 14:06:30 1284

原创 Library not loaded: @loader_path/libmex.dylib

这两天要跑一个asvspoof2017的baseline，matlab的代码，可是出现一个动态库无法加载的问题，搞了好久还请了高人帮忙，终于解决了我自己的问题忘了截图了，说mexmaci64这个文件无效，跟下面差不多：问题如下：Library not loaded: @loader_path/libmex.dylibReferenced from:/Users/usr/Documents/MATLAB/SFMedu2/denseMatch/priority_queue_1.0/pq_create.

2020-06-25 11:52:30 1555 1

原创 20200621--learning-to-fool-the-speaker-recognition-master 实验记录

出错1：RuntimeError: Detected that PyTorch and torchvision were compiled with different CUDA versions. PyTorch has CUDA Version=10.2 and torchvision has CUDA Version=10.1. Please reinstall the torchvision that matches your PyTorch install.解决办法：pip install t

2020-06-22 23:56:50 574

转载 pytorch 最简单示例

# 来自B站刘二大人import torchx_data = torch.Tensor([[1.0], [2.0], [3.0]])y_data = torch.Tensor([[2.0], [4.0], [6.0]])class LinearModel(torch.nn.Module): def __init__(self): super(LinearModel, self).__init__() self.linear = torch.nn.Line

2020-06-21 16:00:17 7039

翻译 icassp2020---XMU-TS SYSTEMS FOR NIST SRE19 CTS CHALLENGE

XMU-TS SYSTEMS FOR NIST SRE19 CTS CHALLENGEHao Lu1, Jianfeng Zhou2, Miao Zhao1, Wendian Lei3, Qingyang Hong∗1, Lin Li∗21School of Informatics, Xiamen University, China，厦门大学信息学院2School of Electronic...

2020-04-22 16:21:12 1726

翻译 icassp2020--TEXT-INDEPENDENT SPEAKER VERIFICATION WITH ADVERSARIAL LEARNING ON SHORT UTTERANCES

TEXT-INDEPENDENT SPEAKER VERIFICATION WITH ADVERSARIAL LEARNING ON SHORT UTTERANCESKai Liu, Huan ZhouArtificial Intelligence Application Research Center, Huawei Technologies Shenzhen, PRCABSTRACT摘...

2020-04-22 16:18:37 1395

翻译 icassp2020会议时间安排

https://cmsworkshops.com/ICASSP2020/TechnicalProgram.asp主要是看看来了解下icassp的topic都有啥，原链接每个topic点进去还有paper列表。

2020-04-13 15:18:42 2532 4

翻译 ICASSP2020一些主题演讲

https://cmsworkshops.com/ICASSP2020/TechnicalProgram.asp@[TOC] 目录T-1: Machine Learning and Wireless Communicationsmobile communications and machine learning are two of the most exciting and rapidly...

2020-04-13 15:01:45 10516

原创 mac 安装pyaudio报错

pip install pyaudio报错说缺少：portaudio.h解决办法：pip install --global-option=‘build_ext’ --global-option=’-I/usr/local/include’ --global-option=’-L/usr/local/lib’ pyaudio参考：https://www.jianshu.com/p/7f81e...

2020-04-01 14:49:42 697 1

原创 linux如何只复制目录结构而不复制数据

find . -type d -exec mkdir -p /data/datasets/musan1/{} ;在当前目录下找类型为d的文件（即目录类型），然后执行后面的操作。当前目录是你要copy的文件夹，-p后面接的目的文件夹...

2020-03-27 15:07:02 4340

翻译 2016--AN EXTENSIBLE SPEAKER IDENTIFICATION SIDEKIT IN PYTHON

AN EXTENSIBLE SPEAKER IDENTIFICATION SIDEKIT IN PYTHONAnthony Larcher1, Kong Aik Lee2, Sylvain Meignier11LIUM - Universite ́ du Maine, France 法国勒芒大学2Human Language Technology Department, Insti...

2020-03-13 19:00:34 995

翻译 2016--MatConvNet Convolutional Neural Networks for MATLAB

Abstract摘要MatConvNet is an implementation of Convolutional Neural Networks (CNNs) for MATLAB. The toolbox is designed with an emphasis on simplicity and flexibility. It exposes the building blocks o...

2020-03-13 18:59:21 933

翻译 AN OPEN-SOURCE SPEAKER GENDER DETECTION FRAMEWORK FOR MONITORING GENDER EQUALITY

AN OPEN-SOURCE SPEAKER GENDER DETECTION FRAMEWORK FOR MONITORING GENDER EQUALITY监测两性平等的开源说话人性别检测框架David Doukhan, Jean CarriveFrench National Institute of Audiovisual Paris, FranceFe ́licien Vallet...

2020-03-13 18:58:18 715

翻译 S4D: Speaker Diarization Toolkit in Python

S4D: Speaker Diarization Toolkit in Python1French National Audiovisual Institute (INA), Paris, France2Computer Science Laboratory of Le Mans University (LIUM - EA 4023), Le Mans, FranceSIDEKIT for ...

2020-03-13 18:57:28 883

翻译 2017--Speaker and Language Recognition and Characterization: Introduction to the CSL Special Issue

2017–Speaker and Language Recognition and Characterization: Introduction to the CSL Special IssueEduardo Lleida1, Luis Javier Rodriguez-Fuentes21 Aragon Institute for Engineering Research (I3A), Uni...

2020-03-13 18:56:37 2890

翻译 2019---Introduction to the special issue “Speaker and language characterization and recog

Introduction to the special issue “Speaker and language characterization and recognition: Voice modeling, conversion, synthesis and ethical aspects”“说话人和语言的特征和识别：声音建模、转换、合成和伦理方面”专题介绍Welcome to this ...

2020-03-13 18:56:00 579

翻译 An initial investigation on optimizing tandem speaker verification and countermeasure systems using

An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning标题：利用强化学习优化串联说话人验证与对抗系统的初步研究作者： Anssi Kanervisto, Junichi Yamagishi链接：https...

2020-02-18 22:30:32 558

翻译 Emotion Recognition Using Speaker Cues

Emotion Recognition Using Speaker Cues标题：基于说话人线索的情感识别作者： Ismail Shahin链接：https://arxiv.org/abs/2002.03566This research aims at identifying the unknown emotion using speaker cues. In this study, we...

2020-02-18 22:29:54 182

翻译 Unsupervised training of neural mask-based beamforming

Unsupervised training of neural mask-based beamformingLukas Drude?, Jahn Heymann?, Reinhold Haeb-UmbachPaderborn University, Department of Communications Engineering, Paderborn, Germany{drude, he...

2020-02-12 23:22:01 254

翻译 2016--Analysis of the DNN-based SRE systems in multi-language conditions

This paper analyzes the behavior of our state-of-the-art Deep Neural Network/i-vector/PLDA-based speaker recognition systems in multi-language conditions. On the “Language Pack” of the PRISM set, we e...

2020-02-12 23:20:51 426

翻译 The LeVoice Far-field Speech Recognition System for VOiCES from a Distance Challenge 2019

The LeVoice Far-field Speech Recognition System for VOiCES from a Distance Challenge 2019Yulong Liang, Lin Yang, Xuyang Wang, Yingjie Li, Chen Jia, Junjie WangLenovo [email protected]重点在...

2020-02-12 22:54:55 457

翻译 Far-Field End-to-End Text-Dependent Speaker Verification based on Mixed Training Data with Transfer

Far-Field End-to-End Text-Dependent Speaker Verification based on Mixed Training Data with Transfer Learning and Enrollment Data AugmentationXiaoyi Qin1,2, Danwei Cai1, Ming Li11Data Science Resea...

2020-02-12 22:51:33 607

翻译 2019--Target Speaker Extraction for Multi-Talker Speaker Verification

Target Speaker Extraction for Multi-Talker Speaker VerificationWei Rao1, Chenglin Xu2,3, Eng Siong Chng2,3, Haizhou Li11Department of Electrical and Computer Engineering, National University of S...

2020-02-12 22:48:22 1743 2

原创 canon ip 1180 喷墨打印机 mac 驱动

下载地址就在这里：http://www.downcc.com/soft/30779.html好激动啊，家里这个老式打印机终于能用了，太开心了。这个打印机用win的话是自动装驱动的，直接就能用。可怜家里唯一的win已经慢的不行了，mac能用就太好啦，开心开心~~~~...

2020-02-07 12:57:14 988

转载 Softmax-based Loss的演化史

https://mp.weixin.qq.com/s/mdBzDaK9pQ7Be4SAId8oQQ

2020-02-05 23:59:51 294

翻译 Within-sample variability-invariant loss for robust speaker recognition under noisy environments

Within-sample variability-invariant loss for robust speaker recognition under noisy environments标题：样本内变异性-噪声环境下稳健说话人识别的不变损失作者： Danwei Cai, Ming Li备注：Accepted at ICASSP 2020链接：https://arxiv.org/abs...

2020-02-04 18:15:51 806

翻译 Multi-channel Acoustic Modeling using Mixed Bitrate OPUS Compression

Multi-channel Acoustic Modeling using Mixed Bitrate OPUS Compression标题：使用混合比特率opus压缩的多声道声学建模作者： Aparna Khare, Minhua Wu链接：https://arxiv.org/abs/2002.00122Recent literature has shown that a learned...

2020-02-04 16:35:11 184

翻译 DropClass and DropAdapt: Dropping classes for deep speaker representation learning

DropClass and DropAdapt: Dropping classes for deep speaker representation learning标题：DropClass和DropAdapt：用于深层说话人表示学习的丢弃类作者： Chau Luu, Steve Renals备注：Submitted to Speaker Odyssey 2020链接：https://arx...

2020-02-04 14:12:18 641

翻译 Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks

Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks标题：基于时域卷积递归神经网络的单通道语音增强作者： Jingdong Li, Changliang Li链接：https://arxiv.org/abs/2002.00319Jingdong Li∗ Hui Zha...

2020-02-04 13:27:43 733

原创如何替换mac word中的换行符为空格

要是win版的word，直接替换就很方便，找到所有^p，然后替换为空格即可。但是mac版的word，直接找^p，根本找不到。试了好多次，终于发现解决办法了。mac版的word默认是显示段落标记的，需要先在word的偏好设置中，把显示段落标记的地方勾掉，在视图里，显示非打印字符，下面有个全部，把全部前面的默认的√，勾掉即可。回来再找^p，就找到了，替换为空格即可。...

2020-02-04 13:19:45 5409

翻译 Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification

Self-Attentive Speaker Embeddings for Text-Independent Speaker VerificationYingke Zhu1, Tom Ko2, David Snyder3, Brian Mak1, Daniel Povey31Department of Computer Science & EngineeringThe Hong Ko...

2020-02-04 01:13:33 1948 1

翻译 SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

SpecAugment: A Simple Data Augmentation Method for Automatic Speech RecognitionDaniel S. Park∗, William Chan, Yu Zhang, Chung-Cheng Chiu, Barret Zoph, Ekin D. Cubuk, Quoc V. LeGoogle Brain{danielsp...

2020-02-01 23:55:48 4020

原创探索说话人识别数据集时要注意的问题

Note:In the speaker id community the words “train”, “test” and “development”are used in a different sense from in the speech recognition community. Inspeaker-id land, the “development” data is the...

2019-12-20 17:09:11 368

翻译 2018--Analysis of Length Normalization in End-to-End Speaker Verification System

Weicheng Cai2, Jinkun Chen2, Ming Li11Data Science Research Center, Duke Kunshan University, Kunshan, China2School of Electronics and Information Technology, Sun Yat-sen University, Guangzhou, China...

2019-12-20 16:33:21 376

翻译 2019-utterance-level end-to-end language identification using attention-based cnn-blstm--icassp 2019

Weicheng Cai1,2,Danwei Cai1, Shen Huang3and Ming Li1∗1Data Science Research Center, Duke Kunshan University, Kunshan, China2School of Electronics and Information Technology, Sun Yat-sen University, ...

2019-12-20 16:09:41 330

翻译 2019-SPEAKER RECOGNITION FOR MULTI-SPEAKER CONVERSATIONS USING X-VECTORS

SPEAKER RECOGNITION FOR MULTI-SPEAKER CONVERSATIONS USING X-VECTORSDavid Snyder , Daniel Garcia-Romero, Gregory Sell, Alan McCree, Daniel Povey, Sanjeev KhudanpurCenter for Language and Speech Proce...

2019-12-20 12:35:37 496