Amanbir Singh, Developer in Delhi, India
Amanbir is available for hire
Hire Amanbir

Amanbir Singh

Verified Expert  in Engineering

Data Scientist and Back-end Developer

Location
Delhi, India
Toptal Member Since
September 13, 2021

Amanbir has 10 years of experience in data science, analytics, and back-end engineering. He has worked at a large multilateral organization and with early-stage tech startups. Amanbir excels at working with clients in tackling complex business problems and has deep expertise in machine learning, data analysis, and building scalable web apps.

Portfolio

Monsoon CreditTech
Python, Pandas, Django, Angular, Docker, Kubernetes, Machine Learning...
Harbor
OpenAI GPT-3 API, GPT, OpenAI GPT-4 API...
Grown Unknown, LLC
Python, Machine Learning, Language Models, OpenAI GPT-4 API, OpenAI GPT-3 API...

Experience

Availability

Part-time

Preferred Environment

Python, Data Analytics, Data Science, Machine Learning, Pandas, Generative Pre-trained Transformers (GPT), OpenAI GPT-3 API, Minimum Viable Product (MVP), Generative Pre-trained Transformer 3 (GPT-3), OpenAI GPT-4 API, User Interface (UI), Product Management, Large Language Models (LLMs)

The most amazing...

...data science project I've worked on is building an automated machine learning platform for credit risk assessment from the ground up.

Work Experience

Head of Product and Engineering

2016 - PRESENT
Monsoon CreditTech
  • Led the development of the SaaS AutoML platform as an architect and product manager; made wireframes, wrote user and functional requirements, decided on back-end architecture, and ran sprints using Django, Angular, Jenkins, and Docker.
  • Architected AutoML libraries used internally. The platform generated machine learning models optimized for lending.
  • Acted as a product manager and architect for developer tools used by our internal data science team to speed up model development and deployment.
  • Managed client engagements with 15 banks and NBFCs; built and deployed models to identify risky borrowers at the time of application. Increased revenue for the client by 20% and more.
  • Hired and managed a team of 10+ data scientists and software developers. Conducted one on ones, set targets for the team, and mentored junior members.
  • Built an auto-deployment process for machine learning models that supported multiple and multistage models.
Technologies: Python, Pandas, Django, Angular, Docker, Kubernetes, Machine Learning, Data Science, Machine Learning Operations (MLOps), XGBoost, Jupyter Notebook, SQL, Data Analytics, Data Visualization, Data Mining, Web Scraping, Data Reporting, Artificial Intelligence (AI), Agile, Data Analysis, Time Series, Time Series Analysis, Optimization, Financial Modeling, Amazon Web Services (AWS), MySQL, Azure, Scikit-learn, Statistics, Statistical Analysis, Real-time Data, Predictive Analytics, APIs, Banking & Finance, Architecture, Leadership, Automation Scripting, Scripting, AWS Lambda, REST APIs, Amazon S3 (AWS S3), HTML, Decision Trees, Data Scientist, Natural Language Processing (NLP), Recommendation Systems, Regression, PDF Scraping, Scraping, Back-end, Software Architecture, Azure ML Studio, Git, Amazon DynamoDB, PostgreSQL, Non-performing Loans (NPL), Data Scraping, TypeScript, NumPy, MongoDB, Serverless, Predictive Modeling, Customer Segmentation, Visualization, Django REST Framework, Full-stack Development, API Integration, AI Design, Automation, Full-stack, CSS, Flask, Solution Architecture, Software Development, PyPDF2, openpyxl, Microservices, Advisory, Technology Strategy & Architecture, Databases, Web Development, CTO, DevOps, Google Cloud Platform (GCP), JavaScript, Object-relational Mapping (ORM), Technical Leadership, Database Architecture, Agile Software Development, Data Structures, Amazon SageMaker, ETL, Minimum Viable Product (MVP), Requirements Analysis, Startups, Mathematics, Task Scheduling, Regular Expressions, Sockets, Linear Regression, Data-driven Decision-making, Decision Modeling, Neural Networks, Programming, Integration, User Interface (UI), Cloud, Models, Exploratory Data Analysis, EDA, Modeling, Data Cleaning, Unstructured Data Analysis, Large Data Sets, Data Gathering, Spreadsheets, Machine Learning Automation, Amazon Elastic Container Service (Amazon ECS), Data Processing, Product Management, Amazon EC2, Back-end Development, Azure Cosmos DB, GitHub, Azure Functions, Azure Blobs, Scrapy, Large Language Models (LLMs), Regression Modeling

AI/ML Expert / Consultant

2023 - 2023
Harbor
  • Did prompt engineering to improve LLM model predictions.
  • Compared open-source LLMs against closed models.
  • Self-hosted open-source LLMs on the company's infrastructure.
  • Built a prompt testing framework in Python to compare and improve prompts.
Technologies: OpenAI GPT-3 API, GPT, OpenAI GPT-4 API, Generative Pre-trained Transformers (GPT), Generative Pre-trained Transformer 3 (GPT-3), AIOps, Machine Learning Operations (MLOps), Natural Language Processing (NLP), Graphics Processing Unit (GPU), AI Design, Amazon SageMaker, Hugging Face, ChatGPT, Amazon EC2, Back-end Development, GitHub, LangChain, Pinecone, Large Language Models (LLMs)

AI/ML Engineer

2023 - 2023
Grown Unknown, LLC
  • Developed prompts to generate customized parental advice using OpenAI APIs.
  • Added context to the prompts to tailor the tone of the outputs.
  • Compared OpenAI with other options and created a plan for future product development.
Technologies: Python, Machine Learning, Language Models, OpenAI GPT-4 API, OpenAI GPT-3 API, GPT, Data Scientist, Language Learning, Generative Systems, Natural Language Processing (NLP), ChatGPT, Large Language Models (LLMs)

Machine Learning Expert

2023 - 2023
AmpVis Ltd.
  • Advised the client on building the MVP, including all technical steps needed.
  • Decided on team structures to handle different product decisions.
  • Consulted on hiring decisions for other technical roles.
Technologies: Python, Machine Learning, Artificial Intelligence (AI), Data Science, APIs, Google Vision API, Amazon Rekognition, Programming, Cloud, Models, Data Scientist, Generative Systems, Deep Learning, Large Language Models (LLMs)

Data Scientist

2023 - 2023
NewCloud Medical LLC
  • Built a Looker Studio dashboard to show data and summary statistics based on filters.
  • Added visualizations in the Looker Studio to generate insights from the data.
  • Created dashboard views that dynamically update based on selected fields.
Technologies: Python, PDF Scraping, Scraping, Databases, Looker, Programming, Language Models, GPT, Data Cleaning, Data Scientist, Spreadsheets, Data Processing, Large Language Models (LLMs)

Research Coordinator

2015 - 2016
JustJobs Network
  • Set up an internal data management system to track versions of datasets.
  • Led research on vocational training and skill-building programs in India. Led data collection and analysis; published a findings report.
  • Designed a training module on statistics and R, which was used for the training of new hires.
Technologies: Python, R, Data Analytics, Data Visualization, Data Mining, Web Scraping, Data Reporting, Data Analysis, Statistics, Statistical Analysis, Automation Scripting, Scripting, Data Scientist, Regression, Scraping, Git, Predictive Modeling, Visualization, Automation, Mathematics, Linear Regression, Data-driven Decision-making, Decision Modeling, Programming, Models, Exploratory Data Analysis, EDA, Modeling, Data Cleaning, Unstructured Data Analysis, Data Gathering, Spreadsheets, Data Processing, Regression Modeling

Consultant

2014 - 2015
World Bank Group
  • Supervised statewide data collection for 4,500 surveys at the individual and household levels.
  • Built models to identify factors that affected education and labor market outcomes for adolescents.
  • Participated in the dissemination of research findings.
Technologies: R, Data Science, Data Analytics, Data Visualization, Data Mining, Data Reporting, Data Analysis, Statistics, Statistical Analysis, Automation Scripting, Scripting, Regression, Git, Predictive Modeling, Visualization, ETL, Mathematics, Linear Regression, Data-driven Decision-making, Decision Modeling, Programming, Models, Exploratory Data Analysis, EDA, Modeling, Data Cleaning, Unstructured Data Analysis, Data Gathering, Spreadsheets, Data Processing, Regression Modeling

Senior Research Associate

2012 - 2014
Centre for Microfinance Research
  • Managed two randomized control trials studying the effect of financial access in India.
  • Trained and supervised a field team of 30 members for 1,700 individual surveys across four districts.
  • Designed and implemented six electronic questionnaires using Open Data Kit and SurveyCTO and built the back end for the survey data.
Technologies: STATA, Survey Design, Open Data Kit, Data Visualization, Data Mining, Data Reporting, Data Analysis, Causal Inference, Statistics, Statistical Analysis, Automation Scripting, Regression, Visualization, Mathematics, Linear Regression, Data-driven Decision-making, Models, Exploratory Data Analysis, EDA, Modeling, Data Cleaning, Unstructured Data Analysis, Data Gathering, Spreadsheets, Data Processing, Regression Modeling

AutoML Platform for Lenders

https://monsoonfintech.com/thoth/
Built an AutoML platform that takes data from lenders and produces state-of-the-art machine-learning models. Supports traditional financial data and alternate (SMS, mobile, etc.) data.

The platform produced models for new applications and to help with collections for running loans. This was offered as a SaaS product.

Custom Machine Learning Models For Lenders

https://monsoonfintech.com/
Managed a team of developers and data scientists to build models for lenders. This included models that predicted the risk of loan applications, recommendation engines for financial products, and marketing models to reach out to identify target customers.

Built and delivered models to the largest lenders in India. This led to a 30% reduction in delinquencies and increased loan approvals by 25%.

Report for the World Bank

https://documents.worldbank.org/en/publication/documents-reports/documentdetail/866381523450216235/a-window-of-opportunity-a-diagnostic-of-adolescent-girls-and-young-women-s-socio-economic-empowerment-in-jharkhand-india
Worked closely with the World Bank to identify critical challenges, along with key reforms, that adolescent girls in Jharkhand, India were facing.

My role included experimental design, data collection, analysis, and modeling. I also worked on the dissemination of the report and communication with key stakeholders.

Languages

Python, HTML, R, SQL, TypeScript, CSS, JavaScript

Frameworks

Django, Django REST Framework, Bootstrap, MUI (Material UI), Angular, Flask, Scrapy

Libraries/APIs

Pandas, XGBoost, Scikit-learn, REST APIs, NumPy, Beautiful Soup, Sockets, Google Vision API, Amazon Rekognition

Tools

Amazon SageMaker, Git, Spreadsheets, Amazon Elastic Container Service (Amazon ECS), GitHub, STATA, Open Data Kit, Azure ML Studio, Looker

Paradigms

Data Science, Automation, Object-relational Mapping (ORM), Agile, Microservices, Agile Software Development, ETL, Requirements Analysis, DevOps

Platforms

Jupyter Notebook, AWS Lambda, Amazon EC2, Docker, Amazon Web Services (AWS), Azure, Azure Functions, Kubernetes, Google Cloud Platform (GCP)

Storage

MySQL, Amazon S3 (AWS S3), PostgreSQL, MongoDB, Databases, Database Architecture, Azure Cosmos DB, Azure Blobs, Amazon DynamoDB

Other

Machine Learning, Data Analytics, Data Mining, Web Scraping, Artificial Intelligence (AI), Data Analysis, Statistics, Statistical Analysis, Predictive Analytics, APIs, Architecture, Automation Scripting, Scripting, Decision Trees, Data Scientist, Natural Language Processing (NLP), Regression, PDF Scraping, Scraping, Back-end, Software Architecture, Non-performing Loans (NPL), Data Scraping, Predictive Modeling, Customer Segmentation, Visualization, Full-stack Development, API Integration, Software Development, PyPDF2, Advisory, Technology Strategy & Architecture, Web Development, CTO, Technical Leadership, OpenAI GPT-3 API, Minimum Viable Product (MVP), Startups, Regular Expressions, Linear Regression, Data-driven Decision-making, Programming, Integration, Models, GPT, Exploratory Data Analysis, EDA, Modeling, Data Cleaning, Unstructured Data Analysis, Large Data Sets, Data Gathering, Machine Learning Automation, Data Processing, Back-end Development, Regression Modeling, Machine Learning Operations (MLOps), Data Visualization, Data Reporting, Time Series, Time Series Analysis, Real-time Data, Leadership, Recommendation Systems, Serverless, AI Design, Full-stack, Solution Architecture, Data Structures, Generative Pre-trained Transformers (GPT), Generative Pre-trained Transformer 3 (GPT-3), Mathematics, Task Scheduling, OpenAI GPT-4 API, Decision Modeling, Neural Networks, Cloud, Language Learning, Generative Systems, ChatGPT, Product Management, Large Language Models (LLMs), Survey Design, SaaS, Optimization, Financial Modeling, Causal Inference, openpyxl, User Interface (UI), Language Models, Deep Learning, AIOps, Graphics Processing Unit (GPU), Hugging Face, LangChain, Pinecone

Industry Expertise

Banking & Finance

2008 - 2012

Bachelor's Degree in Economics and Statistics

Carnegie Mellon University - Pittsburgh, PA, USA