（Caffe，LeNet）初始化训练网络（三）

本文从CSDN上转移过来地址：
http://blog.csdn.net/mounty_fsc/article/details/51090306

1. Solver到Net

在SGDSolver的构造函数中详见本系列博文（二），主要执行了其父类Solver的构造函数，接着执行Solver::Init()函数，在Init()中，有两个函数值得注意：InitTrainNet()和InitTestNets()分别初始化训练网络和测试网络。

InitTrainNet
- 首先，ReadNetParamsFromTextFileOrDie(param_.net(), &net_param)把param_.net()（即examples/mnist/lenet_train_test.prototxt）中的信息读入net_param。
- 其次，net_.reset(new Net<Dtype>(net_param))重新构建网络，调用Net的构造方法。
- 然后，在构造方法中执行Net::init()，开始正式创建网络。其主要代码如下：

template <typename Dtype>
void Net<Dtype>::Init(const NetParameter& in_param) {
...
for (int layer_id = 0; layer_id < param.layer_size(); ++layer_id) {

    // Setup layer.
    const LayerParameter& layer_param = param.layer(layer_id);
 
    layers_.push_back(LayerRegistry<Dtype>::CreateLayer(layer_param));

    // Figure out this layer's input and output
    for (int bottom_id = 0; bottom_id < layer_param.bottom_size();  ++bottom_id) {
      const int blob_id = AppendBottom(param, layer_id, bottom_id, &available_blobs, &blob_name_to_idx);
      // If a blob needs backward, this layer should provide it.
      need_backward |= blob_need_backward_[blob_id];
    }
    int num_top = layer_param.top_size();
    for (int top_id = 0; top_id < num_top; ++top_id) {
      AppendTop(param, layer_id, top_id, &available_blobs, &blob_name_to_idx);
    }
 ...

  layers_[layer_id]->SetUp(bottom_vecs_[layer_id], top_vecs_[layer_id]);
  ...
 }

for (int param_id = 0; param_id < num_param_blobs; ++param_id) {
  AppendParam(param, layer_id, param_id);
}

...
}
```

**说明：**

  1. Lenet5在caffe中共有9层，即`param.layer_size() == 5`，以上代码每一次for循环创建一个网络层
  2. 每层网络是通过`LayerRegistry::CreateLayer()`创建的，类似与Solver的创建（详见本系列博文（二））
  3. 14行`Net::AppendBottom()`，对于`layer_id`这层，从`Net::blob_`中取出blob放入该层对应的`bottom_vecs_[layer_id]`中
  4. 20行`Net::AppendTop()`，对于`layer_id`这层，创建`blob`（未包含数据）并放入`Net::blob_`中
  5. `Layer::SetUp()`
  6. `AppendParam`中把每层网络的训练参数与网络变量`learnable_params_`绑定，在lenet中，只有`conv1`,`conv2`,`ip1`,`ip2`四层有参数，每层分别有参数与偏置参数两项参数，因而`learnable_params_`的size为8.

InitTestNets
<font color="red">该部分内容见本系列博文：（Caffe，Lenet5）初始化测试网络（四）。</font>

2 训练网络结构

序	Layer	layer Type Bottom	Blob Top	Blob Top	Blob Shape
1	minst	Data		data&&label	64 1 28 28 (50176) && 64 (64)
2	conv1	Convolution	data	conv1	64 20 24 24 (737280)
3	pool1	Pooling	conv1	pool1	64 20 12 12 (184320)
4	conv2	Convolution	pool1	conv2	64 50 8 8 (204800)
5	pool2	Pooling	conv2	pool2	64 50 4 4 (51200)
6	ip1	InnerProduct	pool2	ip1	64 500 (32000)
7	relu1	ReLU	ip1	ip1(in-place)	64 500 (32000)
8	ip2	InnerProduct	ip1	ip2	64 10 (640)
9	loss	SoftmaxWithLoss	ip2&&label	loss	(1)

注：Top Blob Shape格式为：BatchSize，ChannelSize，Height，Width（Total Count）

3 第一层：Data Layer

3.1 protobuff定义

训练网络的第一层protobuff定义为：

layer {
  name: "mnist"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TRAIN
  }
  transform_param {
    scale: 0.00390625
  }
  data_param {
    source: "examples/mnist/mnist_train_lmdb"
    batch_size: 64
    backend: LMDB
  }
}

3.2 函数LayerRegistry::CreateLayer

第1节中代码第一次通过调用LayerRegistry::CreateLayer()创建了DataLayer类，DataLayer类的继承关系如下图所示，详见[1]：

由继承图可知，调用DataLayer()的构造函数，依次执行的顺序为其基类构造函数：Layer()、BaseDataLayer()、InternalThread()（详见（Caffe）基本类InternalThread（三））、BasePrefetchingDataLayer()、及DataLayer()。

其中，值得注意的是DataLayer()，在调用基类构造函数BasePrefetchingDataLayer()之后，对 DataReader reader_ 进行赋值，在该DataLayer对象中维护了一个DataReader对象reader_，其作用是添加读取数据任务至，一个专门读取数据库（examples/mnist/mnist_train_lmdb）的线程（若还不存在该线程，则创建该线程），此处一共取出了4*64个样本至BlockingQueue<Datum*> DataReader::QueuePair::full_。详见（Caffe）基本类DataReader、QueuePair、Body（四）

template <typename Dtype>
DataLayer<Dtype>::DataLayer(const LayerParameter& param)
  : BasePrefetchingDataLayer<Dtype>(param),
    reader_(param) {
}

3.3 函数Layer::SetUp

此处按程序执行顺序值得关注的有：
在DataLayer::DataLayerSetUp中根据3.2DataReader中介绍的读取的数据中取出一个样本推测blob的形状

BasePrefetchingDataLayer::LayerSetUp如下代码prefetch_[i].data_.mutable_cpu_data()用到了涉及到gpu、cpu间复制数据的问题，见（Caffe）基本类Blob，Layer，Net（一）1.4SyncedMemory及引用[2]

 // Before starting the prefetch thread, we make cpu_data and gpu_data
 // calls so that the prefetch thread does not accidentally make simultaneous
 // cudaMalloc calls when the main thread is running. In some GPUs this
 // seems to cause failures if we do not so.
 for (int i = 0; i < PREFETCH_COUNT; ++i) {
   prefetch_[i].data_.mutable_cpu_data();
   if (this->output_labels_) {
     prefetch_[i].label_.mutable_cpu_data();
   }
 }

BasePrefetchingDataLayer类继承了InternalThread，BasePrefetchingDataLayer<Dtype>::LayerSetUp中通过调用StartInternalThread()开启了一个新线程，从而执行BasePrefetchingDataLayer::InternalThreadEntry
BasePrefetchingDataLayer::InternalThreadEntry关键代码如下，其中load_batch(batch)为，从2.2介绍的BlockingQueue<Datum*> DataReader::QueuePair::full_（包含从数据库读出的数据）中读取一个batch_size的数据到BlockingQueue<Batch<Dtype>*> BasePrefetchingDataLayer::prefetch_full_中。由于该线程在prefetch_free_为空时将挂起等待（PREFETCH_COUNT=3），prefetch_full_中用完的Batch将放回prefetch_free_中。<u>该线程何时停止？</u>
```
    while (!must_stop()) {
      Batch<Dtype>* batch = prefetch_free_.pop();
      load_batch(batch);
#ifndef CPU_ONLY
      if (Caffe::mode() == Caffe::GPU) {
        batch->data_.data().get()->async_gpu_push(stream);
        CUDA_CHECK(cudaStreamSynchronize(stream));
      }
#endif
      prefetch_full_.push(batch);
    }
```

关于线程的总结：

此外一共涉及到两个线程，分别为都是继承了InnerThread的BasePrefetchingDataLayer(DataLayer)类和DataReader中的Body类
Body为面向数据库的线程，不断从某个数据库中读出数据，存放至缓存为队列DataReader::QueuePair::BlockingQueue<Datum*>，一般保存4*64个单位数据，单位为Datum
BasePrefetchingDataLayer为面向网络的线程，从Body的缓存中不断读取数据。BasePrefetchingDataLayer的缓存为队列BlockingQueue<Batch*>，一般存放3个单位的数据，单位为Batch

static const int PREFETCH_COUNT = 3;
Batch<Dtype> prefetch_[PREFETCH_COUNT];
BlockingQueue<Batch<Dtype>*> prefetch_free_;
BlockingQueue<Batch<Dtype>*> prefetch_full_;

template <typename Dtype>
BasePrefetchingDataLayer<Dtype>::BasePrefetchingDataLayer(
    const LayerParameter& param)
    : BaseDataLayer<Dtype>(param),
      prefetch_free_(), prefetch_full_() {
  for (int i = 0; i < PREFETCH_COUNT; ++i) {
    prefetch_free_.push(&prefetch_[i]);
  }
}

prefetch_full_与prefetch_free_中的元素由prefetch_提供

4 第二层：Convolution Layer

4.1 protobuff定义

layer {
  name: "conv1"
  type: "Convolution"
  bottom: "data"
  top: "conv1"
  param {
    lr_mult: 1
  }
  param {
    lr_mult: 2
  }
  convolution_param {
    num_output: 20
    kernel_size: 5
    stride: 1
    weight_filler {
      type: "xavier"
    }
    bias_filler {
      type: "constant"
    }
  }
}

4.2 函数LayerRegistry::CreateLayer

这里写图片描述

说明：

不像DataLayer 直接执行的是构造函数，此时执行的是GetConvolutuionLayer()，然后调用ConvolutionLayer()，原因如下：

REGISTER_LAYER_CREATOR(Convolution, GetConvolutionLayer);

4.3 Layer::SetUp

在Layer::SetUp中，调用了ConvolutionLayer的基类BaseConvolutionLayer的LayerSetUp及Reshape函数，该类的主要成员变量如下：

/**
 * @brief Abstract base class that factors out the BLAS code common to
 *        ConvolutionLayer and DeconvolutionLayer.
 */
template <typename Dtype>
class BaseConvolutionLayer : public Layer<Dtype> {
 public:
  explicit BaseConvolutionLayer(const LayerParameter& param)
      : Layer<Dtype>(param) {}
  virtual void LayerSetUp(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top);
  virtual void Reshape(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top);

 ...
  /// @brief The spatial dimensions of a filter kernel.
  Blob<int> kernel_shape_;
  /// @brief The spatial dimensions of the stride.
  Blob<int> stride_;
  /// @brief The spatial dimensions of the padding.
  Blob<int> pad_;
  /// @brief The spatial dimensions of the dilation.
  Blob<int> dilation_;
  /// @brief The spatial dimensions of the convolution input.
  Blob<int> conv_input_shape_;
  /// @brief The spatial dimensions of the col_buffer.
  vector<int> col_buffer_shape_;
  /// @brief The spatial dimensions of the output.
  vector<int> output_shape_;
  const vector<int>* bottom_shape_;
...
};

说明：

LayerSetUp函数中，主要是初始化了kernel_shape_、stride_、pad_、dilation_以及初始化网络参数，并存放与Layer::blobs_中。
Reshape函数中，conv_input_shape_、bottom_shape_等

5 第三层：Pooling Layer

5.1 protobuff定义

layer {
  name: "pool1"
  type: "Pooling"
  bottom: "conv1"
  top: "pool1"
  pooling_param {
    pool: MAX
    kernel_size: 2
    stride: 2
  }
}

5.2 Layer::SetUp

通过调用虚函数LayerSetUp及Reshape对以下成员变量进行初始化

/**
 * @brief Pools the input image by taking the max, average, etc. within regions.
 *
 * TODO(dox): thorough documentation for Forward, Backward, and proto params.
 */
template <typename Dtype>
class PoolingLayer : public Layer<Dtype> {
 ....
  int kernel_h_, kernel_w_;
  int stride_h_, stride_w_;
  int pad_h_, pad_w_;
  int channels_;
  int height_, width_;
  int pooled_height_, pooled_width_;
  bool global_pooling_;
  Blob<Dtype> rand_idx_;
  Blob<int> max_idx_;
};

6 第四层、第五层

基本同第二层、第三层

7 第六层：InnerProduct Layer

7.1 protobuff定义

layer {
  name: "ip1"
  type: "InnerProduct"
  bottom: "pool2"
  top: "ip1"
  param {
    lr_mult: 1
  }
  param {
    lr_mult: 2
  }
  inner_product_param {
    num_output: 500
    weight_filler {
      type: "xavier"
    }
    bias_filler {
      type: "constant"
    }
  }
}

7.2 Layer::SetUp

/**
 * @brief Also known as a "fully-connected" layer, computes an inner product
 *        with a set of learned weights, and (optionally) adds biases.
 *
 * TODO(dox): thorough documentation for Forward, Backward, and proto params.
 */
template <typename Dtype>
class InnerProductLayer : public Layer<Dtype> {
 ...
  int M_;
  int K_;
  int N_;
  bool bias_term_;
  Blob<Dtype> bias_multiplier_;
};

说明：

N_为输出大小，即等于protobuff中定义的num_output
K_为输入大小，对于该层Bottom Blob形状为(N, C, H, W)，N为batch_size，K_=CHW（Caffe）基本类Blob，Layer，Net（一），M_=N。其中只有C、H、W跟内积相关

8 第七层：ReLU Layer

8.1 protobuff定义

layer {
  name: "relu1"
  type: "ReLU"
  bottom: "ip1"
  top: "ip1"
}

8.2 说明

ReLULayer主要是用来做计算的，其继承关系如下，详细参加[4]、[5]

9 第八层：InnerProduct Layer

参见第7节

10 第九层：SoftmaxWithLoss Layer

10.1 protobuff定义

layer {
  name: "loss"
  type: "SoftmaxWithLoss"
  bottom: "ip2"
  bottom: "label"
  top: "loss"
}

10.2 LayerRegistry::CreateLayer

这里写图片描述

10.3 Layer::SetUp

值得注意的是：

类SoftmaxWithLossLayer包含类SoftmaxLayer的实例
shared_ptr<Layer<Dtype> > softmax_layer_
softmax_layer_在LayerSetUp中赋值。
此函数内调用Layer::SetLossWeights初始化了该层的Top Blob（loss）
两个类间的关系如下图：

这里写图片描述
成员变量prob_作为Softmaxlayer的top blob
bottom blob[0]作为softmaxlayer的bottom blob
所以经过softmaxlayer计算之后，得出64*10（每个样本的每个类别上的概率）存放在prob_中

11 剩余的工作

至此，训练网络基本创建完毕，接下来剩下的工作主要有：

反向检查一次网络，看哪些blobs会对loss产生影响，在LeNet5中，前面的9层均有影响
初始化权值共享

[1].http://caffe.berkeleyvision.org/doxygen/classcaffe_1_1BasePrefetchingDataLayer.html
[2].http://caffe.berkeleyvision.org/tutorial/net_layer_blob.html Implementation Details
[3].http://caffe.berkeleyvision.org/doxygen/classcaffe_1_1ConvolutionLayer.html
[4].http://caffe.berkeleyvision.org/doxygen/classcaffe_1_1ReLULayer.html
[5].http://caffe.berkeleyvision.org/tutorial/layers.html ReLU / Rectified-Linear and Leaky-ReLU

最后编辑于：2017.12.03 03:53:38

人面猴
序言：七十年代末，一起剥皮案震惊了整个滨河市，随后出现的几起案子，更是在滨河造成了极大的恐慌，老刑警刘岩，带你破解...
沈念sama阅读 204,684评论 6赞 478
死咒
序言：滨河连续发生了三起死亡事件，死亡现场离奇诡异，居然都是意外死亡，警方通过查阅死者的电脑和手机，发现死者居然都...
沈念sama阅读 87,143评论 2赞 381
救了他两次的神仙让他今天三更去死
文/潘晓璐我一进店门，熙熙楼的掌柜王于贵愁眉苦脸地迎上来，“玉大人，你说我怎么就摊上这事。” “怎么了？”我有些...
开封第一讲书人阅读 151,214评论 0赞 337
道士缉凶录：失踪的卖姜人
文/不坏的土叔我叫张陵，是天一观的道长。经常有香客问我，道长，这世上最难降的妖魔是什么？我笑而不...
开封第一讲书人阅读 54,788评论 1赞 277
港岛之恋（遗憾婚礼）
正文为了忘掉前任，我火速办了婚礼，结果婚礼上，老公的妹妹穿的比我还像新娘。我一直安慰自己，他们只是感情好，可当我...
茶点故事阅读 63,796评论 5赞 368
恶毒庶女顶嫁案：这布局不是一般人想出来的
文/花漫我一把揭开白布。她就那样静静地躺着，像睡着了一般。火红的嫁衣衬着肌肤如雪。梳的纹丝不乱的头发上，一...
开封第一讲书人阅读 48,665评论 1赞 281
城市分裂传说
那天，我揣着相机与录音，去河边找鬼。笑死，一个胖子当着我的面吹牛，可吹牛的内容都是我干的。我是一名探鬼主播，决...
沈念sama阅读 38,027评论 3赞 399
双鸳鸯连环套：你想象不到人心有多黑
文/苍兰香墨我猛地睁开眼，长吁一口气：“原来是场噩梦啊……” “哼！你这毒妇竟也来了？” 一声冷哼从身侧响起，我...
开封第一讲书人阅读 36,679评论 0赞 258
万荣杀人案实录
序言：老挝万荣一对情侣失踪，失踪者是张志新（化名）和其女友刘颖，没想到半个月后，有当地人在树林里发现了一具尸体，经...
沈念sama阅读 41,346评论 1赞 299
护林员之死
正文独居荒郊野岭守林人离奇死亡，尸身上长有42处带血的脓包…… 初始之章·张勋以下内容为张勋视角年9月15日...
茶点故事阅读 35,664评论 2赞 321
白月光启示录
正文我和宋清朗相恋三年，在试婚纱的时候发现自己被绿了。大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
茶点故事阅读 37,766评论 1赞 331
活死人
序言：一个原本活蹦乱跳的男人离奇死亡，死状恐怖，灵堂内的尸体忽然破棺而出，到底是诈尸还是另有隐情，我是刑警宁泽，带...
沈念sama阅读 33,412评论 4赞 321
日本核电站爆炸内幕
正文年R本政府宣布，位于F岛的核电站，受9级特大地震影响，放射性物质发生泄漏。R本人自食恶果不足惜，却给世界环境...
茶点故事阅读 39,015评论 3赞 307
男人毒药：我在死后第九天来索命
文/蒙蒙一、第九天我趴在偏房一处隐蔽的房顶上张望。院中可真热闹，春花似锦、人声如沸。这庄子的主人今日做“春日...
开封第一讲书人阅读 29,974评论 0赞 19
一桩弑父案，背后竟有这般阴谋
文/苍兰香墨我抬头看了看天上的太阳。三九已至，却和暖如春，着一层夹袄步出监牢的瞬间，已是汗流浃背。一阵脚步声响...
开封第一讲书人阅读 31,203评论 1赞 260
情欲美人皮
我被黑心中介骗来泰国打工，没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留，地道东北人。一个月前我还...
沈念sama阅读 45,073评论 2赞 350
代替公主和亲
正文我出身青楼，却偏偏与公主长得像，于是被迫代替她去往敌国和亲。传闻我的和亲对象是个残疾皇子，可洞房花烛夜当晚...
茶点故事阅读 42,501评论 2赞 343

（Caffe，LeNet）初始化训练网络（三）

1. Solver到Net

2 训练网络结构

3 第一层：Data Layer

3.1 protobuff定义

3.2 函数LayerRegistry::CreateLayer

3.3 函数Layer::SetUp

4 第二层：Convolution Layer

4.1 protobuff定义

4.2 函数LayerRegistry::CreateLayer

4.3 Layer::SetUp

5 第三层：Pooling Layer

5.1 protobuff定义

5.2 Layer::SetUp

6 第四层、第五层

7 第六层：InnerProduct Layer

7.1 protobuff定义

7.2 Layer::SetUp

8 第七层：ReLU Layer

8.1 protobuff定义

8.2 说明

9 第八层：InnerProduct Layer

10 第九层：SoftmaxWithLoss Layer

10.1 protobuff定义

10.2 LayerRegistry::CreateLayer

10.3 Layer::SetUp

11 剩余的工作

推荐阅读更多精彩内容