Learn Data Skills
Beta
Rakshith Kumar Karkala

Rakshith Kumar Karkala

Certified

Data Scientist

Tanisi Inc | Toronto | Bengaluru

Technologies

My Portfolio Highlights

My New Certification

Data Engineer

My New Certification

Data Scientist Associate

My New Certification

AI Fundamentals

Mining data for gold, building models that drive measurable business outcomes by bridging the gap between machine learning and business strategy.

My Work

Take a look at my latest work.

DataLab

Detecting Tuberculosis in X-Rays using Deep Learning

4Upvotes
notebook

Detecting Tuberculosis in X-Rays by Custom Models

PythonTheory

My Certifications

These are the industry credentials that I’ve earned.

Data Engineer

Data Engineer

AI Engineer for Data Scientists Associate

AI Engineer for Data Scientists Associate

SQL Associate

SQL Associate

Python Data Associate

Python Data Associate

Data Scientist Associate

Data Scientist Associate

AI Fundamentals

AI Fundamentals

Data Literacy

Data Literacy

Other Certificates

Databricks Academy Accreditation - Databricks Fundamentals

Databricks Academy Accreditation - Generative AI Fundamentals

Microsoft Data Visualization Fundamentals

DataCamp Course Completion

Take a look at all the courses I’ve completed on DataCamp.

My Work Experience

Where I've interned and worked during my career.

Tanisi Foods and Beverages | Aug 2022 - Jun 2025

Data Engineer | Data Scientist | Digital Analytics & Performance

Led comprehensive data science and engineering initiatives processing 2.5M+ records monthly, combining advanced predictive modeling with scalable ETL/ELT pipeline architecture while collaborating directly with C- suite executives to transform business challenges into actionable solutions. Built sophisticated machine learning algorithms, recommendation systems, and statistical forecasting models achieving 30% improvement in demand forecasting accuracy, while simultaneously architecting enterprise-grade data infrastructure using cloud data factory patterns with comprehensive data quality checks and validation across development, staging, and production environments. Developed NLP solutions using transformers, NLTK, spaCy, and LangChain frameworks, enhancing customer insights with 35% improved response accuracy for dashboards serving 10,000+ daily users, while implementing real-time streaming architectures using Apache Kafka with KSQL for stream processing and cloud messaging services ensuring data integrity and performance optimization. Deployed production-ready ML models using MLflow, Docker, and Kubernetes with automated CI/CD pipelines achieving 99.5% uptime, while designing comprehensive data warehouse solutions with advanced data modeling techniques including 3rd Normal Form and Kimball/ Inmon methodology, integrating Hadoop ecosystem tools with enterprise database patterns for seamless data migration. Created compelling data narratives and visualizations using Power BI and Tableau that drove strategic decision-making, resulting in $50K+ annual cost savings through analytical optimization, while implementing comprehensive data quality frameworks, monitoring systems, data lineage tracking, and metadata management using Azure Data Factory, Azure Data Lake Gen2, Azure Synapse Analytics, GCP BigQuery, Dataflow, Pub/Sub, AWS S3, EMR, Glue, and various data architecture patterns including Lakehouse, Medallion, and Delta Architecture.
Show More

Damodar IT Solutions Pvt. Ltd. | Mar 2020 - Apr 2021

Data Analyst Associate | Full Stack Dev | Data Scientist & Engineering

Delivered integrated data science and engineering solutions processing 500K+ daily records by combining predictive analytics development with automated data pipelines to drive decisions across marketing, sales, and operations. Designed and deployed predictive models using Python, scikit- learn, and advanced statistical methods, while automating ETL workflows with Pandas, NumPy, SQL optimization, and PowerShell for production- grade processing with rigorous validation and quality checks. Built analytical workflows that reduced processing time by 50% through statistical optimization and performance profiling. Managed enterprise databases (PostgreSQL, SQL Server, MySQL, MongoDB) with zero downtime, implementing 3rd Normal Form and dimensional modeling (Kimball methodology) to optimize warehouse design and query performance. Created robust preprocessing pipelines for structured and unstructured data, ensuring statistical rigor and data quality. Developed interactive BI dashboards in Power BI and Tableau, streamlining reporting and enabling informed decision-making across departments. Presented insights through compelling visualizations and data storytelling to business stakeholders. Engineered and optimized RESTful APIs and GraphQL endpoints using C# design patterns and Node.js frameworks for seamless integration with third-party systems and external data. Established data quality frameworks, monitoring systems, and automated alerting to safeguard data integrity and system reliability, embedding observability and continuous optimization across all database systems.

Curiouz TechLabs | Sep 2019 - Mar 2020

Full Stack Developer | Data Scientist & Engineering

Pioneered AI-driven healthcare solutions by integrating advanced data science with robust engineering infrastructure, specializing in medical imaging analysis and processing 50K+ CT scans through enterprise-grade data systems and deep learning for clinical decision support. Designed and deployed 3D CNN models in TensorFlow for bladder cancer staging classification, while building scalable data infrastructure to collect, clean, validate, and process structured and unstructured imaging data using Python, advanced SQL, and healthcare-compliant tools. Engineered custom CNN architectures for feature extraction and transformation of grayscale CT images, significantly improving tumor detection accuracy. Developed Hadoop-based big data pipelines with Hive to support distributed medical imaging analytics, ensuring data integrity, performance optimization, and fault tolerance under strict compliance standards. Built end-to-end machine learning pipelines with advanced statistical validation, evaluation frameworks, and deployment capabilities tailored for healthcare applications. Collaborated with medical researchers, clinical analysts, and professionals at Kasturba Medical College to translate complex clinical requirements into scalable technical solutions for enterprise data management. Established rigorous testing protocols, validation frameworks, and continuous monitoring systems to guarantee data integrity, security, and patient privacy across all workflows. Delivered healthcare- grade solutions that balanced regulatory compliance with innovation, enabling reliable medical data pipelines, optimized imaging analytics, and impactful clinical decision support.

My Education

Take a look at my formal education

Post Graduate Degree in Data Analysis in Business Decision Making Durham College | 2022
Bachelor's Degree in Computer Science Alva's College of Education | 2020

About Me

Rakshith Kumar Karkala

Data Scientist , Engineer & Analyst with 4+ years building ETL pipelines and deploying ML models on AWS/GCP/Azure. Specialized in Python, SQL, Spark, and cloud architectures that drive measurable business impact.

Powered by

  • Work
  • Certifications
  • Courses
  • Experience
  • Education
  • About Me
  • Create Your Data Portfolio for Free