自定义博客皮肤VIP专享

*博客头图:

格式为PNG、JPG,宽度*高度大于1920*100像素,不超过2MB,主视觉建议放在右侧,请参照线上博客头图

请上传大于1920*100像素的图片!

博客底图:

图片格式为PNG、JPG,不超过1MB,可上下左右平铺至整个背景

栏目图:

图片格式为PNG、JPG,图片宽度*高度为300*38像素,不超过0.5MB

主标题颜色:

RGB颜色,例如:#AFAFAF

Hover:

RGB颜色,例如:#AFAFAF

副标题颜色:

RGB颜色,例如:#AFAFAF

自定义博客皮肤

-+

tiankonghewo的专栏

好好学习,天天向上

  • 博客(10)
  • 资源 (24)
  • 收藏
  • 关注

原创 git上传本地文件到gitHub

亲测有效git远程仓库已经建好了,本地文件已经存在了,现在要将本地代码推到git远程仓库保存参考文章https://www.cnblogs.com/zhangsanfeng/p/10163968.htmlhttps://blog.csdn.net/qq_34446663/article/details/80468752git push -u origin master -f 强制pu...

2019-09-20 13:35:51 127

原创 scala疑惑(一) ListSet添加元素

object Test extends Logging { def main(args: Array[String]): Unit = { val a=scala.collection.immutable.ListSet(21,100,23) val b=a+4 b.foreach(println) }}这里的a+4调用了scala.collecti...

2019-09-12 19:18:11 939

原创 估算the JVM heap中object占用内存大小

org.apache.spark.util.collection.SizeTracker#takeSamplespark在shuffle的read和write阶段,都涉及到采样估算集合占用内存大小/** * Take a new sample of the current collection's size. */ private def takeSample(): Unit...

2019-09-11 16:36:42 254

原创 spark sql 自定义udf函数

import org.apache.spark.sql.functions._def compare(value_missing: String, value: String): Boolean = { var flag = false if (value_missing.length == value.length) { flag = value_missing....

2019-08-29 16:58:46 471

原创 spark源码剖析(二,ShuffleReader)

版本信息spark version 2.3.3jdk 1.8idea 2019MacBook Prospark的shuffle过程连接了job的前后两个stage除了第一个stage的数据是读取hdfs,hbase,hive等等之外其他的stage的数据都要利用ShuffleReader抓取数据ShuffleReaderShuffleReader是一个trait, 从注释看,...

2019-08-28 23:22:27 159

原创 case class的序-----Ordering和Ordered

版本信息scala 2.11.8jdk 1.8idea 2019MacBook ProOrdering在scala里要自定义一个类的话,一般都是case class,例如case class Student(name:String, score:Int)如果我们有了一个Student的数组val students = Array(Student("bob", 80), St...

2019-08-27 17:05:12 196

原创 spark中shuffle算子汇总

版本信息spark version 2.3.3jdk 1.8idea 2019MacBook Pro我们先在idea中搜素一下ShuffleDependency可以看到,生成的依赖是ShuffleDependency的RDD有CoGroupedRDDShuffledRDDSubtractedRDD然后我们分别看下什么算子产生了这些RDDShuffledRDD我们...

2019-08-26 16:34:37 774

原创 spark源码剖析(一,job调用流程)

最近领导让做一次关于spark的分享,于是专门把spark的流程看了一边,做一下记录,也是为了练练markdown,仅此而已。版本信息spark version 2.3.3jdk 1.8idea 2019MacBook Pro##从RDD开始在spark中,一个action算子触发真正的计算,我们看下RDD上的count/** * Return the number o...

2019-08-25 21:44:31 221

原创 文档模板Latex

\documentclass[UTF8,10pt,a4paper]{article}\usepackage{ctex}\usepackage{amsmath}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{graphicx}\usepackage{bm} \usepackage{pdfpages}\author{wsy}\

2017-07-15 21:26:13 451

原创 报告模板LaTex

% !TeX spellcheck = en_GB% WangSheying于2015/11/2整理,TJU北洋园校区% TeXLive2015+TeXstudio个人推荐,可在线升级usepackage,比较方便%***************************************************************************************

2017-07-15 21:25:14 1556

Data Structures and Algorithms with Scala.pdf

Data Structures and Algorithms with Scala A Practitioners Approach with Emphasis on Functional Programming.pdf 高清版本,非扫描

2019-07-17

jdk1.8.0_171.jdk.zip

苹果MacBook安装的jdk8压缩包,自己上官网太麻烦,苹果MacBook安装的jdk8压缩包,自己上官网太麻烦

2019-07-17

Graph_Databases_2e_Neo4j.mobi

Graph Databases, published by O’Reilly Media, discusses the problems that are well aligned with graph databases, with examples drawn from practical, real-world use cases. This book also looks at the ecosystem of complementary technologies, highlighting what differentiates graph databases from other database technologies, both relational and NOSQL.

2019-07-17

Graph_Databases_2e_Neo4j.epub

Graph Databases, published by O’Reilly Media, discusses the problems that are well aligned with graph databases, with examples drawn from practical, real-world use cases. This book also looks at the ecosystem of complementary technologies, highlighting what differentiates graph databases from other database technologies, both relational and NOSQL.

2019-07-17

Graph_Algorithms_Neo4j.mobi

Graph Algorithms Practical Examples in Apache Spark & Neo4j,高清版kindle版

2019-07-17

hadoop完全分布式高可用配置文件

hadoop完全分布式高可用配置文件,还有spark和zookeeper的配置,后续可以自己更改,很方便的,............................................................................................................................................................................................................

2017-09-01

hadoop-2.8.1完全分布式搭建脚本和配置文件

3台zookeeper,实现namenode和resourcemanager的高可用,脚本实现12台机器ssh免密登陆的全部自动化,还有配置文件的分发也全部是脚本实现,这样就很省事了

2017-08-24

hadoop免密登陆脚本

一个脚本命令实现免密登陆配置

2017-08-21

Hadoop- The Definitive Guide, 4th

Hadoop- The Definitive Guide, 4th

2017-08-13

进程和Vim讲解

进程和Vim

2017-08-13

Linux入门与基础

Linux入门与基础

2017-08-13

MapReduce-algorithms

MapReduce-algorithms

2017-08-12

the elements of statistical learning

the elements of statistical learning

2017-08-12

Machine Learning-A Probabilistic Perspective

Machine Learning

2017-08-12

hadoop权威指南天气测试案例和执行脚本

hadoop权威指南天气测试案例和执行脚本

2017-08-09

linux课程资料

linux课程资料

2017-08-02

Purely Functional Data Structures.pdf

Purely Functional Data Structures

2017-07-28

Programming in Scala 3rd Edition

Programming in Scala 3rd Edition ,英文原版

2017-07-12

Learning from data

Learning from data

2016-06-11

Kernel Methods for Pattern Analysis

Kernel Methods for Pattern Analysis,原文件网址http://cs.du.edu/~mitchell/mario_books/Kernel_Methods_for_Pattern_Analysis_-_John_Shawe-Taylor_&_Nello_Christianini.pdf

2016-06-05

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人

提示
确定要删除当前文章?
取消 删除