Split Learning: Distributed and collaborative learning

Graphical representation of a split learning model with three database icons forming a triangle and lines connecting them each to the one to each side of themselves. There is one small graphical snail travelling across each line between the databases.
Credit: https://arxiv.org/pdf/1812.00564.pdf

Abstract: Can a server utilize deep learning models for training or inference without accessing raw data from clients? Split learning naturally allows for various configurations of cooperating entities to train (and infer from) machine learning models without sharing any raw data or detailed information about the model.

Key Idea: In the simplest of configurations of split learning, each client (for example, radiology center) trains a partial deep network up to a specific layer known as the cut layer. The outputs at the cut layer are sent to another entity (server/another client) which completes the rest of the training without looking at raw data from any client that holds the raw data. This completes a round of forward propagation without sharing raw data. The gradients are now back propagated again from its last layer until the cut layer in a similar fashion. The gradients at the cut layer (and only these gradients) are sent back to radiology client centers. The rest of back propagation is now completed at the radiology client centers. This process is continued until the distributed split learning network is trained without looking at each others raw data.

Split learning’s computational and communication efficiency on clients:

Client-side communication costs are significantly reduced as the data to be transmitted is restricted to initial layers of the split learning network (splitNN) prior to the split. The client-side computation costs of learning the weights of the network are also significantly reduced for the same reason. In terms of model performance, the accuracies of Split NN remained competitive to other distributed deep learning methods like federated learning and large batch synchronous SGD with a drastically smaller client side computational burden when training on a larger number of clients as shown below in terms of teraflops of computation and gigabytes of communication when split learning is used to train Resnet and VGG architectures over 100 and 500 clients with CIFAR 10 and CIFAR 100 datasets.

Versatile plug-and-play configurations of split learning

Versatile configurations of split learning configurations cater to various practical settings of i) multiple entities holding different modalities of patient data, ii) centralized and local health entities collaborating on multiple tasks, iii) learning without sharing labels, iv) multi-task split learning, v) multi-hop split learning and other hybrid possibilities to name a few as shown below and further detailed in our paper here (PDF).

RESEARCH AREAS
line drawing of a computer microchip - a box with four lines coming out of each side and a smaller box in the center with the bottom right corner angled

Computation

line drawing of a head in profile with a microchip overlaying the brain area of the head

Machine Learning

IMPACT AREAS
line drawing of a stack of computer servers connected to each other

Big Data

line drawing of three intersecting circles, one with a dollar symbol, one with a euro symbol, one with a bitcoin symbol

Finance

line drawing of a heart with a one pulse line in the middle of it

Healthcare / Medicine

line drawing of a train trestle bridge with three columns underneath and an arched span across the top

Infrastructure

line drawing of a wrench

Manufacturing

line drawing of three people figures (one top center, one bottom left, and one bottom right) with a circle in the center between them and lines connecting each figure to the center circle

Social Networks

RESEARCH AREAS
line drawing of a computer microchip - a box with four lines coming out of each side and a smaller box in the center with the bottom right corner angled

Computation

line drawing of a head in profile with a microchip overlaying the brain area of the head

Machine Learning

IMPACT AREAS
line drawing of a stack of computer servers connected to each other

Big Data

line drawing of three intersecting circles, one with a dollar symbol, one with a euro symbol, one with a bitcoin symbol

Finance

line drawing of a heart with a one pulse line in the middle of it

Healthcare / Medicine

line drawing of a train trestle bridge with three columns underneath and an arched span across the top

Infrastructure

line drawing of a wrench

Manufacturing

line drawing of three people figures (one top center, one bottom left, and one bottom right) with a circle in the center between them and lines connecting each figure to the center circle

Social Networks

Visit the Camera Culture Group site to learn more.