Skip to main content
  1. Data Science Courses/

Big Data

·153 words·1 min· loading · ·
ML Courses Machine Learning ML Courses

On This Page

Table of Contents
Share with :

Big Data

Big Data Analytics
#

Big Data Systems
#

  1. What is Big Data
  2. Data Warehouse, Data Lakes
  3. Hadoop – Components
  4. Storage – HDFS, Hbase
  5. Resource Manager (MapReduce, YARN)
  6. Types of data formats (JSON, ORC, Parquet, AVRO)
  7. Scripting  (Hive, Pig)
  8. Stream Processing
  9. Massive Parallel Processing (Spark, Imapala, Mahout)
  10. RDDs in Spark
  11. Data Migration (Scoop/ Flume)
  12. Schedular (Oozie)
  13. Resource Negotiator (Zookeeper)
  14. RDBMS Database
  15. Columnar Database
  16. Multimodel Database
  17. NoSQL (HBase, Cassandra, MongoDB, DynamoDB)
  18. RDBMS (MySQL, PostgreSQL)
  19. CosmoDB
  20. In memory database (Redis)
  21. Spark SQL
  22. Case Study

Stream Processing & Analytics
#

  1. Real Time Streaming Architecture
  2. Service Configuration and Coordination
  3. Data Flow Management, Storing and Processing Streaming Data
  4. Visualization Techniques for Real Time Streaming Data
  5. Aggregation (Timed Counting, Multi Resolution Time Series Aggregation)
  6. Statistical Approximation
  7. Approximating with sketches

PySpark
#

  1. Overview & Installation.
  2. RDD
  3. Dataframe.
  4. Architecture.
  5. MLLib
  6. NLP
  7. Linear regression
  8. Logistic regression
  9. Decision tree
  10. Naive Bayes
  11. XGBoost
  12. Timeseries
  13. Spark Job automation with Scheduler
  14. NYC Parking Case Study: Apache Spark
Dr. Hari Thapliyaal's avatar

Dr. Hari Thapliyaal

Dr. Hari Thapliyal is a seasoned professional and prolific blogger with a multifaceted background that spans the realms of Data Science, Project Management, and Advait-Vedanta Philosophy. Holding a Doctorate in AI/NLP from SSBM (Geneva, Switzerland), Hari has earned Master's degrees in Computers, Business Management, Data Science, and Economics, reflecting his dedication to continuous learning and a diverse skill set. With over three decades of experience in management and leadership, Hari has proven expertise in training, consulting, and coaching within the technology sector. His extensive 16+ years in all phases of software product development are complemented by a decade-long focus on course design, training, coaching, and consulting in Project Management. In the dynamic field of Data Science, Hari stands out with more than three years of hands-on experience in software development, training course development, training, and mentoring professionals. His areas of specialization include Data Science, AI, Computer Vision, NLP, complex machine learning algorithms, statistical modeling, pattern identification, and extraction of valuable insights. Hari's professional journey showcases his diverse experience in planning and executing multiple types of projects. He excels in driving stakeholders to identify and resolve business problems, consistently delivering excellent results. Beyond the professional sphere, Hari finds solace in long meditation, often seeking secluded places or immersing himself in the embrace of nature.

Comments:

Share with :

Related

AI for Prospective Email Writing
·491 words·3 mins· loading
ML Courses TensorFlow Lite Android Development
AI for Prospective Email Writing # Course Objective # Equip participants with the skills to draft …
GenAI for Cybersecurity
·526 words·3 mins· loading
ML Courses TensorFlow Lite Android Development
GenAI for Cybersecurity # Course Overview: Here’s a simplified and enriched version of your course …
Train Tensorflow Lite Models for Android
·852 words·4 mins· loading
ML Courses TensorFlow Lite Android Development
Course Title: Developing Solutions with Agentic AI # Course Outline # Module 1: Introduction to …
AI Powered Account Management Strategies
·421 words·2 mins· loading
ML Courses Artificial Intelligence Account Management
Program Outline: AI Powered Account Management Strategies # Duration: # 2 Days Course Audience: # …
Generative AI for Client and Stakeholder Engagement
·412 words·2 mins· loading
ML Courses Generative AI Stakeholder Engagement
Program Outline: AI Powered Client and Stakeholder Engagement # Duration: # 2 Days Course Audience: …