自定义博客皮肤VIP专享

*博客头图:

格式为PNG、JPG,宽度*高度大于1920*100像素,不超过2MB,主视觉建议放在右侧,请参照线上博客头图

请上传大于1920*100像素的图片!

博客底图:

图片格式为PNG、JPG,不超过1MB,可上下左右平铺至整个背景

栏目图:

图片格式为PNG、JPG,图片宽度*高度为300*38像素,不超过0.5MB

主标题颜色:

RGB颜色,例如:#AFAFAF

Hover:

RGB颜色,例如:#AFAFAF

副标题颜色:

RGB颜色,例如:#AFAFAF

自定义博客皮肤

-+
  • 博客(180)
  • 资源 (18)
  • 收藏
  • 关注

原创 20190510 语音识别资源整理

语音处理课程推荐|Speech Processing(2019) 台师大Speech Processing。国立台湾师范大学的陈柏琳教授。http://berlin.csie.ntnu.edu.tw/Courses/Speech Processing/Speech Processing_Main_2019S.htm陈教授教学多年,主页上还有好多其他课程。http://berlin.csi...

2020-04-13 18:39:37 5482 4

翻译 Attention模型的调查报告(有翻译)An Attentive Survey of Attention Models

这是arxiv上19年4月的论文,花了一天时间翻译了一下,但是感觉对于我这样的小白来说,然并卵。下载地址:

2019-04-17 17:18:48 564

原创 win10安装tpshop遇到的修改sql_mode的问题

win10安装tpshop遇到的修改sql_mode的问题

2023-02-03 11:46:55 244

原创 macbook安装win10没有声音

macbook安装win10没有声音我的macbook是2015年款找到右下角声音图标,右键,选择声音,选择播放选项卡,我的播放选项卡是这个样子:扬声器、数字音频之类都可以点击右键—测试,测试哪个有声音就给它设置成默认设备就好了。我的一开始默认的是数字音频,但是测试没有声音,扬声器测试的时候就有声音,设成默认以后就好了。...

2021-03-11 21:01:30 8744

原创 oppoA57 连上电脑之后没反应

今天要给妈妈的手机做照片备份,结果oppoA57 连上电脑之后没反应,看不见手机的内存卡,找了一大圈办法,弄各种驱动都没有解决。后来才发现oppo手机自带了一个app叫文件管理,里面有个远程管理,打开服务之后,只要电脑和手机在一个网络下,电脑就可以看见手机里的资料了。买手机之后第一次用的时候删掉了一大堆oppo自带的软件,幸亏这个app没有删。...

2020-10-14 14:06:30 1284

原创 Library not loaded: @loader_path/libmex.dylib

这两天要跑一个asvspoof2017的baseline,matlab的代码,可是出现一个动态库无法加载的问题,搞了好久还请了高人帮忙,终于解决了我自己的问题忘了截图了,说mexmaci64这个文件无效,跟下面差不多:问题如下:Library not loaded: @loader_path/libmex.dylibReferenced from:/Users/usr/Documents/MATLAB/SFMedu2/denseMatch/priority_queue_1.0/pq_create.

2020-06-25 11:52:30 1555 1

原创 20200621--learning-to-fool-the-speaker-recognition-master 实验记录

出错1:RuntimeError: Detected that PyTorch and torchvision were compiled with different CUDA versions. PyTorch has CUDA Version=10.2 and torchvision has CUDA Version=10.1. Please reinstall the torchvision that matches your PyTorch install.解决办法:pip install t

2020-06-22 23:56:50 574

转载 pytorch 最简单示例

# 来自B站刘二大人import torchx_data = torch.Tensor([[1.0], [2.0], [3.0]])y_data = torch.Tensor([[2.0], [4.0], [6.0]])class LinearModel(torch.nn.Module): def __init__(self): super(LinearModel, self).__init__() self.linear = torch.nn.Line

2020-06-21 16:00:17 7039

翻译 icassp2020---XMU-TS SYSTEMS FOR NIST SRE19 CTS CHALLENGE

XMU-TS SYSTEMS FOR NIST SRE19 CTS CHALLENGEHao Lu1, Jianfeng Zhou2, Miao Zhao1, Wendian Lei3, Qingyang Hong∗1, Lin Li∗21School of Informatics, Xiamen University, China,厦门大学信息学院2School of Electronic...

2020-04-22 16:21:12 1726

翻译 icassp2020--TEXT-INDEPENDENT SPEAKER VERIFICATION WITH ADVERSARIAL LEARNING ON SHORT UTTERANCES

TEXT-INDEPENDENT SPEAKER VERIFICATION WITH ADVERSARIAL LEARNING ON SHORT UTTERANCESKai Liu, Huan ZhouArtificial Intelligence Application Research Center, Huawei Technologies Shenzhen, PRCABSTRACT摘...

2020-04-22 16:18:37 1395

翻译 icassp2020会议时间安排

https://cmsworkshops.com/ICASSP2020/TechnicalProgram.asp主要是看看来了解下icassp的topic都有啥,原链接每个topic点进去还有paper列表。

2020-04-13 15:18:42 2532 4

翻译 ICASSP2020一些主题演讲

https://cmsworkshops.com/ICASSP2020/TechnicalProgram.asp@[TOC] 目录T-1: Machine Learning and Wireless Communicationsmobile communications and machine learning are two of the most exciting and rapidly...

2020-04-13 15:01:45 10516

原创 mac 安装pyaudio报错

pip install pyaudio报错说缺少:portaudio.h解决办法:pip install --global-option=‘build_ext’ --global-option=’-I/usr/local/include’ --global-option=’-L/usr/local/lib’ pyaudio参考:https://www.jianshu.com/p/7f81e...

2020-04-01 14:49:42 697 1

原创 linux如何只复制目录结构而不复制数据

find . -type d -exec mkdir -p /data/datasets/musan1/{} ;在当前目录下找类型为d的文件(即目录类型),然后执行后面的操作。当前目录是你要copy的文件夹,-p后面接的目的文件夹...

2020-03-27 15:07:02 4340

翻译 2016--AN EXTENSIBLE SPEAKER IDENTIFICATION SIDEKIT IN PYTHON

AN EXTENSIBLE SPEAKER IDENTIFICATION SIDEKIT IN PYTHONAnthony Larcher1, Kong Aik Lee2, Sylvain Meignier11LIUM - Universite ́ du Maine, France 法国 勒芒大学2Human Language Technology Department, Insti...

2020-03-13 19:00:34 995

翻译 2016--MatConvNet Convolutional Neural Networks for MATLAB

Abstract摘要MatConvNet is an implementation of Convolutional Neural Networks (CNNs) for MATLAB. The toolbox is designed with an emphasis on simplicity and flexibility. It exposes the building blocks o...

2020-03-13 18:59:21 933

翻译 AN OPEN-SOURCE SPEAKER GENDER DETECTION FRAMEWORK FOR MONITORING GENDER EQUALITY

AN OPEN-SOURCE SPEAKER GENDER DETECTION FRAMEWORK FOR MONITORING GENDER EQUALITY监测两性平等的开源说话人性别检测框架David Doukhan, Jean CarriveFrench National Institute of Audiovisual Paris, FranceFe ́licien Vallet...

2020-03-13 18:58:18 715

翻译 S4D: Speaker Diarization Toolkit in Python

S4D: Speaker Diarization Toolkit in Python1French National Audiovisual Institute (INA), Paris, France2Computer Science Laboratory of Le Mans University (LIUM - EA 4023), Le Mans, FranceSIDEKIT for ...

2020-03-13 18:57:28 883

翻译 2017--Speaker and Language Recognition and Characterization: Introduction to the CSL Special Issue

2017–Speaker and Language Recognition and Characterization: Introduction to the CSL Special IssueEduardo Lleida1, Luis Javier Rodriguez-Fuentes21 Aragon Institute for Engineering Research (I3A), Uni...

2020-03-13 18:56:37 2890

翻译 2019---Introduction to the special issue “Speaker and language characterization and recog

Introduction to the special issue “Speaker and language characterization and recognition: Voice modeling, conversion, synthesis and ethical aspects”“说话人和语言的特征和识别:声音建模、转换、合成和伦理方面”专题介绍Welcome to this ...

2020-03-13 18:56:00 579

翻译 An initial investigation on optimizing tandem speaker verification and countermeasure systems using

An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning标题:利用强化学习优化串联说话人验证与对抗系统的初步研究作者: Anssi Kanervisto, Junichi Yamagishi链接:https...

2020-02-18 22:30:32 558

翻译 Emotion Recognition Using Speaker Cues

Emotion Recognition Using Speaker Cues标题:基于说话人线索的情感识别作者: Ismail Shahin链接:https://arxiv.org/abs/2002.03566This research aims at identifying the unknown emotion using speaker cues. In this study, we...

2020-02-18 22:29:54 182

翻译 Unsupervised training of neural mask-based beamforming

Unsupervised training of neural mask-based beamformingLukas Drude?, Jahn Heymann?, Reinhold Haeb-UmbachPaderborn University, Department of Communications Engineering, Paderborn, Germany{drude, he...

2020-02-12 23:22:01 254

翻译 2016--Analysis of the DNN-based SRE systems in multi-language conditions

This paper analyzes the behavior of our state-of-the-art Deep Neural Network/i-vector/PLDA-based speaker recognition systems in multi-language conditions. On the “Language Pack” of the PRISM set, we e...

2020-02-12 23:20:51 426

翻译 The LeVoice Far-field Speech Recognition System for VOiCES from a Distance Challenge 2019

The LeVoice Far-field Speech Recognition System for VOiCES from a Distance Challenge 2019Yulong Liang, Lin Yang, Xuyang Wang, Yingjie Li, Chen Jia, Junjie WangLenovo [email protected]重点在...

2020-02-12 22:54:55 457

翻译 Far-Field End-to-End Text-Dependent Speaker Verification based on Mixed Training Data with Transfer

Far-Field End-to-End Text-Dependent Speaker Verification based on Mixed Training Data with Transfer Learning and Enrollment Data AugmentationXiaoyi Qin1,2, Danwei Cai1, Ming Li11Data Science Resea...

2020-02-12 22:51:33 607

翻译 2019--Target Speaker Extraction for Multi-Talker Speaker Verification

Target Speaker Extraction for Multi-Talker Speaker VerificationWei Rao1, Chenglin Xu2,3, Eng Siong Chng2,3, Haizhou Li11Department of Electrical and Computer Engineering, National University of S...

2020-02-12 22:48:22 1743 2

原创 canon ip 1180 喷墨打印机 mac 驱动

下载地址就在这里:http://www.downcc.com/soft/30779.html好激动啊,家里这个老式打印机终于能用了,太开心了。这个打印机用win的话是自动装驱动的,直接就能用。可怜家里唯一的win已经慢的不行了,mac能用就太好啦,开心开心~~~~...

2020-02-07 12:57:14 988

转载 Softmax-based Loss的演化史

https://mp.weixin.qq.com/s/mdBzDaK9pQ7Be4SAId8oQQ

2020-02-05 23:59:51 294

翻译 Within-sample variability-invariant loss for robust speaker recognition under noisy environments

Within-sample variability-invariant loss for robust speaker recognition under noisy environments标题:样本内变异性-噪声环境下稳健说话人识别的不变损失作者: Danwei Cai, Ming Li备注:Accepted at ICASSP 2020链接:https://arxiv.org/abs...

2020-02-04 18:15:51 806

翻译 Multi-channel Acoustic Modeling using Mixed Bitrate OPUS Compression

Multi-channel Acoustic Modeling using Mixed Bitrate OPUS Compression标题:使用混合比特率opus压缩的多声道声学建模作者: Aparna Khare, Minhua Wu链接:https://arxiv.org/abs/2002.00122Recent literature has shown that a learned...

2020-02-04 16:35:11 184

翻译 DropClass and DropAdapt: Dropping classes for deep speaker representation learning

DropClass and DropAdapt: Dropping classes for deep speaker representation learning标题:DropClass和DropAdapt:用于深层说话人表示学习的丢弃类作者: Chau Luu, Steve Renals备注:Submitted to Speaker Odyssey 2020链接:https://arx...

2020-02-04 14:12:18 641

翻译 Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks

Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks标题:基于时域卷积递归神经网络的单通道语音增强作者: Jingdong Li, Changliang Li链接:https://arxiv.org/abs/2002.00319Jingdong Li∗ Hui Zha...

2020-02-04 13:27:43 733

原创 如何替换mac word中的换行符为空格

要是win版的word,直接替换就很方便,找到所有^p,然后替换为空格即可。但是mac版的word,直接找^p,根本找不到。试了好多次,终于发现解决办法了。mac版的word默认是显示段落标记的,需要先在word的偏好设置中,把显示段落标记的地方勾掉,在视图里,显示非打印字符,下面有个全部,把全部前面的默认的√,勾掉即可。回来再找^p,就找到了,替换为空格即可。...

2020-02-04 13:19:45 5409

翻译 Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification

Self-Attentive Speaker Embeddings for Text-Independent Speaker VerificationYingke Zhu1, Tom Ko2, David Snyder3, Brian Mak1, Daniel Povey31Department of Computer Science & EngineeringThe Hong Ko...

2020-02-04 01:13:33 1948 1

翻译 SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

SpecAugment: A Simple Data Augmentation Method for Automatic Speech RecognitionDaniel S. Park∗, William Chan, Yu Zhang, Chung-Cheng Chiu, Barret Zoph, Ekin D. Cubuk, Quoc V. LeGoogle Brain{danielsp...

2020-02-01 23:55:48 4020

原创 探索说话人识别数据集时要注意的问题

Note:In the speaker id community the words “train”, “test” and “development”are used in a different sense from in the speech recognition community. Inspeaker-id land, the “development” data is the...

2019-12-20 17:09:11 368

翻译 2018--Analysis of Length Normalization in End-to-End Speaker Verification System

Weicheng Cai2, Jinkun Chen2, Ming Li11Data Science Research Center, Duke Kunshan University, Kunshan, China2School of Electronics and Information Technology, Sun Yat-sen University, Guangzhou, China...

2019-12-20 16:33:21 376

翻译 2019-utterance-level end-to-end language identification using attention-based cnn-blstm--icassp 2019

Weicheng Cai1,2,Danwei Cai1, Shen Huang3and Ming Li1∗1Data Science Research Center, Duke Kunshan University, Kunshan, China2School of Electronics and Information Technology, Sun Yat-sen University, ...

2019-12-20 16:09:41 330

翻译 2019-SPEAKER RECOGNITION FOR MULTI-SPEAKER CONVERSATIONS USING X-VECTORS

SPEAKER RECOGNITION FOR MULTI-SPEAKER CONVERSATIONS USING X-VECTORSDavid Snyder , Daniel Garcia-Romero, Gregory Sell, Alan McCree, Daniel Povey, Sanjeev KhudanpurCenter for Language and Speech Proce...

2019-12-20 12:35:37 496

语音识别大神dan-povery介绍kaldi的ppt.rar

附件是语音识别大神dan-povery介绍kaldi的ppt,虽然时间有点早,但是内容都很基础,kaldi新手入门必看,有讲kaldi中数据的一般格式和语音识别的一般流程

2019-10-30

An Attentive Survey of Attention Models注意力模型调查报告(有翻译)

这是arxiv上19年4月的论文,花了一天时间翻译了一下,但是感觉对于我这样的小白来说,然并卵。翻译都在注释里。怎么改低积分呢,5分太高了吧?找不到可以改的地方啊~~

2019-04-17

王赟大神的全部论文

这是截止到2019年4月的大神的所有论文,标题和摘要都有翻译,更多了解大神看这里:https://blog.csdn.net/yj13811596648/article/details/89359337

2019-04-17

A Neural Attention Model for Abstractive Sentence Summarization

这是一个视频的字幕,使用方法见这里:https://blog.csdn.net/yj13811596648/article/details/89354314

2019-04-17

基于高斯混合模型的说话人识别

使用说明请看这里:https://blog.csdn.net/yj13811596648/article/details/88746350

2019-03-22

说话人识别数据集--Spoken Speaker Identification based on Gaussian Mixture Models-2

这是part2 。使用说明看这里:https://blog.csdn.net/yj13811596648/article/details/88746350

2019-03-22

说话人识别数据集--Spoken Speaker Identification based on Gaussian Mixture Models-1

使用说明请看这里:https://blog.csdn.net/yj13811596648/article/details/88746350

2019-03-22

使用GMMs进行语音性别检测

这是我自己的翻译版,原文在这里:https://blog.csdn.net/yj13811596648/article/details/88737623

2019-03-22

语音识别数据集-speech analytic--性别识别--Voice Gender Detection using GMMs-2

使用说明在这里:https://blog.csdn.net/yj13811596648/article/details/88737623

2019-03-22

语音识别数据集-speech analytic--性别识别--Voice Gender Detection using GMMs-1

使用说明在这里 https://blog.csdn.net/yj13811596648/article/details/88737623

2019-03-22

Neo4j权威指南-图数据库-大数据时代的新利器.pdf

Neo4j权威指南-图数据库-大数据时代的新利器.pdf,Neo4j权威指南-图数据库-大数据时代的新利器.pdf

2019-01-03

黄勇-知了课堂-flask-40-50课源代码及数据库文件

老师的视频地址如下:https://study.163.com/course/courseMain.htm?courseId=1004091002,代码亲测有效

2018-12-11

清新蓝粉风格ppt模版

很漂亮的ppt模版,

2018-12-05

2017-IEEE数据库专题讲座课件

你真的会用IEEE数据库的检索功能么?来看看吧,这是IEEE的产品讲师做的,会有收获的

2018-11-30

Mac 数据库mysql查看软件 navicat

亲测可用,Mac 数据库mysql查看软件 navicat,

2018-11-19

knn算法--整理byGraceyan

根据以下视频整理的ppt,视频地址:https://study.163.com/course/courseMain.htm?courseId=1005709005

2018-10-21

数字0到9和英文大小写字母手写识别训练集

数字0到9和英文大小写字母手写识别训练集,每份都是55张,共计55*(10+26+26)=3410张png图片,使用举例:https://blog.csdn.net/yj13811596648/article/details/83241708

2018-10-21

中国传媒大学2016研究生入学考试计算机数据结构与网络试题

中国传媒大学2016研究生入学考试计算机数据结构与网络试题

2018-10-14

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人

提示
确定要删除当前文章?
取消 删除