Telesens

Telesens

  • Home
  • About Me
  • Contact
  • Stories
  • Apps

Distributed data parallel training using Pytorch on AWS

April 4, 2019 ankur6ue 5

In this post, I’ll describe how to use distributed data parallel techniques on multiple AWS GPU servers to speed up Machine Learning (ML) training. Along […]

Pipelining data processing and host-to-device data transfer

February 16, 2019 ankur6ue 0

The purpose of this post is to show how to use multi-threading to parallelize data processing with data transfer from pageable to page-locked memory. I […]

Neural Network Loss Visualization

January 16, 2019 ankur6ue 0

Plotting its shape helps in understanding the properties and behaviour of a function. Unfortunately since we live in a 3D world, we can’t visualize functions […]

Wrapping a Python Application into a Web Service using mod_wsgi and gunicorn

October 1, 2018 ankur6ue 0

So you developed a cool AI algorithm and want to show it off through a web service? You know a lot about AI algorithms and […]

Understanding Matrix Multiplication on a Weight-Stationary Systolic Architecture

July 30, 2018 ankur6ue 5

If you follow the hardware for deep learning space, you may have heard of the term “systolic array”. A 2D systolic array forms the heart […]

Understanding Roofline Charts

July 26, 2018 ankur6ue 4

The purpose of this post is to discuss my current understanding of roofline charts. Let me lay some background first. Before I got into machine […]

Data Augmentation in SSD (Single Shot Detector)

June 28, 2018 ankur6ue 2

Over the past few days, I have been investigating how SSD (Single Shot Detector), an object detector introduced in the following paper in Dec 2016 […]

Efficiently Computing the Fisher Vector Product in TRPO

June 9, 2018 ankur6ue 8

The purpose of this post is to provide math proofs and clarify some implementation details in the recently introduced reinforcement learning method called “Trust Region […]

Initializing Weights for the Convolutional and Fully Connected Layers

April 9, 2018 ankur6ue 0

You may have noticed that weights for convolutional and fully connected layers in a deep neural network (DNN) are initialized in a specific way. For […]

Object Detection and Classification using R-CNNs

March 11, 2018 ankur6ue 34

In this post, I’ll describe in detail how R-CNN (Regions with CNN features), a recently introduced deep learning based object detection and classification method works. R-CNN’s […]

Posts navigation

« 1 2 3 … 5 »

Recent Posts

  • Deploying Ray on a local kubernetes cluster
  • Speeding up Shapley value computation using Ray, a distributed computing system
  • Kernel SHAP
  • Building a Information Retrieval system based on the Covid-19 research challenge dataset: Part 3
  • Building a Information Retrieval system based on the Covid-19 research challenge dataset: Part 2

Categories

Other Machine Learning Posts
  • No Image

    Deploying Ray on a local kubernetes cluster

    November 8, 2020 0
  • No Image

    Kernel SHAP

    September 17, 2020 0
  • Building a Information Retrieval system based on the Covid-19 research challenge dataset: Part 3

    June 16, 2020 0
  • Building a Information Retrieval system based on the Covid-19 research challenge dataset: Part 2

    June 10, 2020 2
  • Building a Information Retrieval system based on the Covid-19 research challenge dataset: Part 1

    June 10, 2020 1
Other Sensor Fusion Related Posts
  • Sensor Fusion: Part 4

    May 7, 2017 0
  • Sensor Fusion – Part 3: Implementation of Gyro-Accel Sensor Fusion

    May 2, 2017 6
  • Sensor Fusion: Part 2 (combining Gyro-Accel data)

    April 30, 2017 0
  • Sensor Fusion: Part 1

    April 27, 2017 0

Copyright © 2021 | WordPress Theme by MH Themes