张博208-CSDN博客

原创 Multiprocessing Event Object In Python

Event

2024-04-17 10:55:13 679

原创 Multiprocessing Freeze Support in Python

You can add support for multiprocessing when freezing your code via the multiprocessing.freeze_support() function.In this tutorial you will discover how to add freeze support for multiprocessing in your Python program.You can package your code with a Pytho

2024-04-17 10:38:29 1020

原创 Understand torch.scatter

scatter 理解

2024-04-10 16:16:03 625

转载 NVIDIA MPS总结

MPS多进程服务（Multi-Process Scheduling）是CUDA应用程序编程接口（API）的替代二进制兼容实现。从Kepler的GP10架构开始，NVIDIA就引入了MPS（基于软件的多进程服务），这种技术在当时实际上是称为HyperQ ，允许多个流（stream）或者CPU的进程同时向GPU发射Kernel函数，结合为一个单一应用程序的上下文在GPU上运行，从而实现更好的GPU利用率。在单个进程的任务处理，对GPU利用率不高的情况下是非常有用的。

2024-02-06 10:02:34 127

原创 CUDA Thread Indexing

CUDA Thread Indexing。

2023-07-10 16:45:58 387

转载 Facebook如何训练超大模型系列

我们在前文介绍过，微软 ZeRO 可以对一个万亿参数模型可以使用 8 路模型并行、64 路管道并行和 8 路数据并行在 4,096 个 NVIDIA A100 GPU 上进行扩展。而FSDP（Fully Sharded Data Parallel）是Facebook 深度借鉴微软ZeRO之后提出的PyTorch DDP升级版本，可以认为是对标微软 ZeRO，其本质是 parameter sharding。Parameter sharding 就是把模型参数等切分到各个GPU之上。

2023-06-21 15:31:24 317

原创大模型训练技术论文

大模型训练框架论文

2023-06-21 11:23:43 1071

转载大模型训练框架Megatron原理系列转载

NVIDIA Megatron 是一个基于 PyTorch 的分布式训练框架，用来训练超大Transformer语言模型，其通过综合应用了数据并行，Tensor并行和Pipeline并行来复现 GPT3，值得我们深入分析其背后机理。本系列大概有6～7篇文章，通过论文和源码和大家一起学习研究。本文把 Megatron 的两篇论文/一篇官方PPT 选取部分内容，糅合在一起进行翻译分析，希望大家可以通过本文对 Megatron 思路有一个基本了解。

2023-06-20 11:34:03 2098

转载人脸识别损失函数(Center-Loss、A-Softmax、AM-Softmax、ArcFace)

引用

2023-05-30 18:22:10 245

原创 linkname, soname and realname

As you may already know, the linker1 option should be used to specify the name of the library which is needed by your application when it was compiled. For example, if we write a program which utilize libjpeg, we should specify explicitly when we compile

2023-05-22 13:26:11 559

原创 Linux资料

学习

2023-05-06 15:32:29 60

原创通过例子深入了解c++/c的构建系统

C/C++ is the mother of many popular programming languages out there today, all the fancy programming languages we use today like Python, JavaScript are built using C/C++. For example, the standard python interpreter is built using C and the most popular J

2023-04-25 13:37:04 1690 4

原创 PixelShuffle上采样原理

PixelShuffle是目前通用上采样upsample技术中性能最好的。PixelShuffle现已广泛应用在如图像分割等计算机视觉问题上，和。一起成为了神经网络中最常用的两种上采样技术。

2023-04-12 11:36:45 127

原创 @contextmanager 和yield 的结合使用法

上面这个例子，我们可以把print('1') 和 print('3')看成是try和except，要执行代码的部分就是他们中间的部分,进行以下更改。先执行commit(),遇到yield后，跳出来执行sql1,sql2，然后再进入commit()执行上一次跳出来的位置。所以要引用装饰器@contextmanager，它需要与with语法结合使用。但是每次都这样try...except...就会显得很重复。

2023-04-06 17:45:34 267

原创 Understanding Diffusion Probabilistic Models (DPMs) 扩散模型，最好资料

diffusion model step by step

2023-03-28 17:59:32 801 1

原创扩散模型学习

diffusion model 学习资料呢

2023-03-27 16:24:50 65

原创 RLHF知识

然而，对生成结果的评估是主观和依赖上下文的，这些结果难以用现有的基于规则的文本生成指标 (如 BLUE 和 ROUGE) 来衡量。除了评估指标，现有的模型通常以预测下一个单词的方式和简单的损失函数 (如交叉熵) 来建模，没有显式地引入人的偏好和主观意见。因此，训练阶段，如果直接用人的偏好（或者说人的反馈）来对模型整体的输出结果计算reward或loss，显然是要比上面传统的“给定上下文，预测下一个词”的损失函数合理的多。：即使用强化学习的方法，利用人类反馈信号直接优化语言模型。

2023-03-24 11:39:07 1056

转载 HRNet网络简介

转载

2023-03-23 18:30:22 66

原创 diffusion models

diffusion models

2023-03-17 11:33:32 67

原创 Swin-Transformer网络结构详解

swin-transformer

2023-03-14 16:24:48 64

原创 Linux 编译链接动态库版本号

无论程序是否按libxxx.so.a.b.c格式命名，但Linux上差点儿全部动态库在编译时都指定了-soname，我们能够通过readelf工具查看soname，比方文章开头列举的两个动态库。这时候发现，生成了新的链接libhello.so.1，而main程序还是使用的libhello.so.0，所以无法使用新版动态库的功能，须要又一次编译才行。执行可执行文件时保证libxxx.so.a.b.c文件存在，通过ldconfig生成libxxx.so.a链接指向libxxx.so.a.b.c。

2023-01-31 11:32:44 842

原创 Changing input size of pre-trained models in Keras

https://ckyrkou.medium.com/changing-input-size-of-pre-trained-models-in-keras-3dfbe3ca3091

2023-01-19 16:06:34 127

原创多卡训练时设置batch_size和learning_rate

Linear Scale Rule:

2023-01-06 10:50:04 771

原创计算机视觉神经网络基础理论

基础问题

2022-12-09 14:56:35 1149

原创转置卷积中的棋盘效应

转置卷积中的棋盘效应_CtrlZ1的博客-CSDN博客_棋盘效应的原因

2022-11-28 11:19:58 166

原创 Python中import的用法

python 中的相对引用绝对引用

2022-11-21 11:46:23 823

原创 Neural Turing Machines-NTM系列

NTM是一种使用Neural Network为基础来实现传统图灵机的理论计算模型。利用该模型，可以通过训练的方式让系统“学会”具有时序关联的任务流。论文：http://arxiv.org/abs/1410.5401中文翻译：http://www.dengfanxin.cn/?p=60ppt:http://llcao.net/cu-deeplearning15/presentati

2022-11-03 09:30:45 300

原创 TensorRT概述

深度学习模型研发的生命周期包括五步：目标确认、任务建模、数据采集与标注、模型训练、模型部署。作为炼丹师，接触得最多的是前面四步，但是模型部署也是非常重要的一环，它是模型落地前的临门一脚。

2022-11-02 13:40:19 173

原创 Python Machine Learning Cookbook

import numpy as npfrom sklearn import preprocessingdata = np.array([[ 3, -1.5, 2, -5.4], [ 0, 4, -0.3, 2.1], [ 1, 3.3, -1.9, -4.3]])# mean removaldata_standardized ...

2022-11-02 09:39:13 161

原创 spark能传递外部命名参数给main函数吗？

查了资料好像都没有办法。只能通过： def main(args: Array[String]): Unit = { // 读取参数 var city = args(0) var input = args(1) var date = args(2)下标来获取。不过不确定，去stackoverflow发帖问下。https://stackoverflow.com/que...

2022-11-02 09:36:55 134

原创 Huber Loss function

Huber loss是为了增强平方误差损失函数（squared loss function）对噪声（或叫离群点，outliers）的鲁棒性提出的。DefinitionLδ(a)={12a2,δ⋅(|a|−12δ),for|a|≤δ,otherwise.Lδ(a)={12a2,for|a|≤δ,δ⋅(|a|−12δ),otherwise.参数aa通常表示residuals...

2022-11-02 09:35:34 102

转载 AI编译优化--总纲

随着AI模型结构的快速演化，底层计算硬件的层出不穷，用户使用习惯的推陈出新，单纯基于手工优化来解决AI模型的性能和效率问题越来越容易出现瓶颈。为了应对这些问题，AI编译优化技术已经成为一个获得广泛关注的技术方向。这两年来，这个领域也异常地活跃，包括老牌一些的、、、，以及最近呼声很高的，能够看到不同的公司、社区在这个领域进行着大量的探索和推进。本篇会先做一下整体介绍以及行业现状的分析。

2022-09-02 17:46:05 680

转载经典论文阅读笔记——VIT、Swin Transformer、MAE、CILP

学习

2022-09-01 21:59:27 69

原创 prompt learning 学习资料

学习资料

2022-09-01 18:34:46 240

原创多模态学习

多模态学习资料

2022-09-01 18:01:27 610

原创神经网络算子融合

算子融合技术

2022-09-01 16:01:13 798

转载 Pytorch自动混合精度(AMP)训练

Pytorch自动混合精度(AMP)训练_ytusdc的博客-CSDN博客_pytorch 混合精度训练

2022-09-01 08:32:32 61

原创如何在深度学习中使用自动混合精度训练

DL中的混合精度训练是同时使用单精度(32位)和半精度(16位)表示的过程。这是通过模型在FP32中为需要精度的计算保留一些训练来实现的，而在FP16中对精度不那么重要的其他计算进行训练。回归DL的基本训练原理，复习基础技术，可以缓解神经网络训练阶段的压力，优化GPU的使用。在本指南中，我们将更深入地探讨混合精度训练和自动混合精度训练，以及它如何保持神经网络训练阶段的准确性，同时减少训练所花费的时间。这里没有太多需要调整的地方，但这将是你的代码中支持混合精度训练的部分，所以要特别小心并仔细检查你的工作。..

2022-08-31 21:38:42 709

原创大规模模型训练tricks集锦

大规模模型训练其实就是在和计算、存储和通信玩的过程，所以我列一下跟这些相关的文章。一. 大规模模型并行策略先来介绍一下几种经典的并行范式，以及他们对应的经典文章。

2022-08-31 21:08:20 870 1

原创千亿参数“一口闷”？大模型训练必备四种策略

AI领域的许多最新进展都围绕大规模神经网络展开，但训练大规模神经网络是一项艰巨的工程和研究挑战，需要协调GPU集群来执行单个同步计算。随着集群数和模型规模的增长，机器学习从业者开发了多项技术，在多个GPU上进行并行模型训练。乍一看，这些并行技术令人生畏，但只需对计算结构进行一些假设，这些技术就会变得清晰——在这一点上，就像数据包在网络交换机之间传递一样，那也只是从A到B传递并不透明的位（bits）。三层模型中的并行策略。每种颜色代表一层，虚线分隔不同的 GPU。训练神经网络是一个迭代的过程。...

2022-08-31 21:06:33 540

GPU-知识点资料合集

bank_conflicts coalescing

2023-08-03

Pro Go The Complete Guide -go语言学习最新书籍

Best-selling author Adam Freeman explains how to get the most from Go, starting from the basics and building up to the most advanced and sophisticated features. You will learn how Go builds on a simple and consistent type system to create a comprehensive and productive development experience that produces fast and robust applications that run across platforms 参见：https://www.amazon.com/Pro-Go-Complete-Programming-Efficient/dp/1484273540/ref=sr_1_1?crid=1K22H21ZB1EIZ&keywords=Pro+Go+The+Complete+G

2023-06-19

扩散模型讲义美国大学之一

2023-03-28

Advanced_Programming_in_the_UNIX_Environment，_3rd

Advanced_Programming_in_the_UNIX_Environment，_3rd_Edition very good book for unix user

2018-11-30

Pattern_Recognition_and_Big_Data

Pattern_Recognition_and_Big_Data 很好的资源，对于学习大数据的朋友来说

2018-09-07

图论引导中文

中文版本图论引导

2018-09-05

现代图论--------------

现代图论研究生教材适合大家学习与总结了

2018-09-05

Deep_Learning_Quick_Reference

Deep_Learning_Quick_Reference, a cookbook for deep learning

2018-09-01

Convex Optimization Algorithms

Convex Optimization Algorithms, understand convex optimization algorithms, this is good chances

2018-09-01

Guide.to.Medical.Image.Analysis.Methods.and.Algorithms

Guide.to.Medical.Image.Analysis.Methods.and.Algorithms very good book for computer vision

2018-09-01

machine learning algorithm

machine learning algorithm 想学习的可以好好学学了

2018-04-02

Hands-On Data Science and Python Machine Learning py

2018-03-27

Python Machine Learning Machine Learning and Deep Learning

Python Machine Learning Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow, 2nd Edition 很受推荐

2018-03-27

Data Structures and Algorithms Using Python and C++

Data Structures and Algorithms Using Python and C++ 数据结构与算法方面的书籍

2018-03-27

R_for_Data_Science

R_for_Data_Science_－_Import，_Tidy，_Transform，_Visualize_and_Model_Data.rar

2018-03-27

深度学习之Pytorch

国内少有的学习 pytorch的资料,适合初学者, 希望对大家有帮助,清晰版本

2018-03-27

Deep Learning with PyTorch pdf版本

Pdf 版本, 方便阅读而且操作, 如果需要代码,请到如下地址

2018-03-27

Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow

Table of Contents Giving Computers the Ability to Learn from Data Training Simple Machine Learning Algorithms for Classification A Tour of Machine Learning Classifiers Using Scikit-Learn Building Good Training Sets - Data Preprocessing Compressing Data via Dimensionality Reduction Learning Best Practices for Model Evaluation and Hyperparameter Tuning Combining Different Models for Ensemble Learning Applying Machine Learning to Sentiment Analysis Embedding a Machine Learning Model into a Web Application Predicting Continuous Target Variables with Regression Analysis Working with Unlabeled Data - Clustering Analysis Implementing a Multilayer Artificial Neural Network from Scratch Parallelizing Neural Network Training with TensorFlow Going Deeper - The Mechanics of TensorFlow Classifying Images with Deep Convolutional Neural Networks Modeling Sequential Data using Recurrent Neural Networks

2018-03-17

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人

GPU-知识点资料合集

Pro Go The Complete Guide -go语言学习最新书籍

扩散模型讲义 美国大学之一

Advanced_Programming_in_the_UNIX_Environment，_3rd

Pattern_Recognition_and_Big_Data

图论引导 中文

现代图论--------------

Deep_Learning_Quick_Reference

Convex Optimization Algorithms

Guide.to.Medical.Image.Analysis.Methods.and.Algorithms

machine learning algorithm

Hands-On Data Science and Python Machine Learning py

Python Machine Learning Machine Learning and Deep Learning

Data Structures and Algorithms Using Python and C++

R_for_Data_Science

深度学习之Pytorch

Deep Learning with PyTorch pdf版本

Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow

Approximate.Dynamic.Programming.2011

计算群体智能基础

Swarm Intelligence Principles Advances and Applications

Neural_Network_Methods_in_Natural_Language_Processing

Reinforcement Learning With Open A TensorFlow and Keras Using Python.pdf

Fundamentals of Deep Learning完整非扫描版本2017

Data Wrangling with R

NLTK基础教程-用NLTK和Python库构建机器学习应用2017-06

Text Mining in Practice with R 2017.12

Text_Mining－From_Ontology_Learning_to_Automated_Text_Processing_Applications

Python Natural Language Processing最新版本

Mastering Scipy

Elegant SciPy

Tensorflow 机器学习参考手册2007

reinforcement learning An Introduction 第二版

Spark大数据处理技术 带标签 完整版

TensorFlow技术解析与实战 高清晰完整版- 2017新书

TENSORFLOW深度学习

模式分类11

集体编程智慧

敏捷软件开发：原则、模式与实践

面向对象方法原理与实践

空空如也

扩散模型讲义美国大学之一

图论引导中文

Spark大数据处理技术带标签完整版

TensorFlow技术解析与实战高清晰完整版- 2017新书