Tiny-PyTorch

Introduction

Tiny-Pytorch is a deep learning system that is similar in nature to Pytorch. It involves implementing the core underlying machinery and algorithms behind deep learning systems such as 1) Automatic differentiation, 2) Tensor (multi-dimensional array), 3) Neural network modules such as Linear/BatchNorm/RNN/LSTM, 4) Optimization algorithms such as Stochastic Gradient Boosting (SGD) and Adaptive Momentum (Adam), 5) Hardware acceleration such as GPUs, etc.

The main learning and critical part of this project is building everything from the ground up:

graph BT
    A[Flat Array] --> B[N-Dimensional Array]
    B[N-Dimensional Array] --> C[Tensor]

Learning Objectives

Build deep learning systems
- Contribute to open-source deep learning frameworks.
- Work on developing my own framework for specific tasks. I have been collecting my own implementation of different things in Pytorch such as analyzing gradients of each layer.
Use existing systems more effectively:
- Understanding how the internals of existing deep learning systems work let you use them much more efficiently.
- The only way to understand how things really work is to build it from scratch.
Understand how operations are carried on both CPU and GPU so I can optimize my customized models/layers to run more efficiently.

Road Map

To make things simple, we will follow top-down approach. In other words, we will first build Tensor and all its machinery such as Operations, automatic differentiation, etc.. During this phase, we will be using numpy as our backend. Once we're done with basic building blocks of our Tensor, we will move on to build NDArray and different backends that can be used to do the computation.

Phase I:
- Tensor: A multi-dimensional array that includes elements of the same type. It is the main component in our automatic differentiation because it will include: operation that created it, input data used in the operation, the output data, etc. In the case it was a leaf or deteched Tensor, everything will be None.
- Op: Operations on Tensors. Each operation should implement forward and backward pass and returns a new Tensor.
- Automatic Differentiation: The method we will be using to build the automatic differentiation framework is called Reverse Mode Automatic Differentiation (AD). It is much more efficient that the alternative Forward Mode Automatic Differentiation (AD).
- init: Functions to initialize neural network parameters.
- nn: Basic building blocks of neural network graph such as Linear, Conv2d, BatchNorm, etc.
- optimizer: Implementation of various optimizations algorithms such as SGD and Adam.
- data: Classes to load various types of data; mainly Dataset and DataLoader.
Phase II:
- NDArray: A generic class that supports multiple backends and provide us with strided array. All the underlying arrays are flat arrays stored in row-major order, but NDArray will help us represent any multi-dimensional arrays using offset, strides, shape.
- Numpy backend (default backend)
- CPU backend
- Cuda backend
- CNN and its main operations such as padding and dilation
- Resnet
- RNN
- LSTM
- LLM

Documentation

The official documentation is hosted on https://imaddabbura.github.io/tiny-pytorch/.

Limitations

Broadcasting has to be done explicitly for all element-wise operations if both ndarrays/tensors are not of the same shape. For example, if we have two tensors x & y that have (10,) and (20, 10) shapes respectively. We can add them together as follows:
```
x.reshape((1, 10)).broadcast_to((20, 10)) + y
```
NDArray only supports float32 dtype
All operations on the underlying 1D flat array is done on compact arrays. Therefore, we would need to call compact() before any operation to create contiguous array if it is not already compacted
Reduction sum/max can either be done on 1 axis or all axes. Summing/max across a set of axes isn't supported

License

Tiny-PyTorch has Apache License, as found in the LICENCE file.

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
.github/workflows		.github/workflows
docs		docs
tests		tests
tiny_pytorch		tiny_pytorch
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
mkdocs.yml		mkdocs.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tiny-PyTorch

Introduction

Learning Objectives

Road Map

Documentation

Limitations

License

About

Releases 1

Languages

License

ImadDabbura/tiny-pytorch

Folders and files

Latest commit

History

Repository files navigation

Tiny-PyTorch

Introduction

Learning Objectives

Road Map

Documentation

Limitations

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Languages