A Manual Implementation of Quantization in PyTorch - Single Layer
Part II :  Shrinking Neural Networks for Embedded Systems Using Low Rank Approximations (LoRA)