Instructor Led Live Online
Self Learning + Live Mentoring
Customize Your Training
The entire training includes real-world projects and highly valuable case studies.
IABAC® certification provides global recognition of the relevant skills, thereby opening opportunities across the world.
MODULE 1: DATA SCIENCE ESSENTIALS
• Introduction to Data Science
• Evolution of Data Science
• Big Data Vs Data Science
• Data Science Terminologies
• Data Science vs AI/Machine Learning
• Data Science vs Analytics
MODULE 2: DATA SCIENCE DEMO
• Business Requirement: Use Case
• Data Preparation
• Machine learning Model building
• Prediction with ML model
• Delivering Business Value.
MODULE 3: ANALYTICS CLASSIFICATION
• Types of Analytics
• Descriptive Analytics
• Diagnostic Analytics
• Predictive Analytics
• Prescriptive Analytics
• EDA and insight gathering demo in Tableau
MODULE 4: DATA SCIENCE AND RELATED FIELDS
• Introduction to AI
• Introduction to Computer Vision
• Introduction to Natural Language Processing
• Introduction to Reinforcement Learning
• Introduction to GAN
• Introduction to Generative Passive Models
MODULE 5: DATA SCIENCE ROLES & WORKFLOW
• Data Science Project workflow
• Roles: Data Engineer, Data Scientist, ML Engineer and MLOps Engineer
• Data Science Project stages.
MODULE 6: MACHINE LEARNING INTRODUCTION
• What Is ML? ML Vs AI
• ML Workflow, Popular ML Algorithms
• Clustering, Classification And Regression
• Supervised Vs Unsupervised
MODULE 7: DATA SCIENCE INDUSTRY APPLICATIONS
• Data Science in Finance and Banking
• Data Science in Retail
• Data Science in Health Care
• Data Science in Logistics and Supply Chain
• Data Science in Technology Industry
• Data Science in Manufacturing
• Data Science in Agriculture
MODULE 1: PYTHON BASICS
• Introduction of python
• Installation of Python and IDE
• Python Variables
• Python basic data types
• Number & Booleans, strings
• Arithmetic Operators
• Comparison Operators
• Assignment Operators
MODULE 2: PYTHON CONTROL STATEMENTS
• IF Conditional statement
• IF-ELSE
• NESTED IF
• Python Loops basics
• WHILE Statement
• FOR statements
• BREAK and CONTINUE statements
MODULE 3: PYTHON DATA STRUCTURES
• Basic data structure in python
• Basics of List
• List: Object, methods
• Tuple: Object, methods
• Sets: Object, methods
• Dictionary: Object, methods
MODULE 4: PYTHON FUNCTIONS
• Functions basics
• Function Parameter passing
• Lambda functions
• Map, reduce, filter functions
MODULE 1: OVERVIEW OF STATISTICS
• Introduction to Statistics
• Descriptive And Inferential Statistics
• Basic Terms Of Statistics
• Types Of Data
MODULE 2: HARNESSING DATA
• Random Sampling
• Sampling With Replacement And Without Replacement
• Cochran's Minimum Sample Size
• Types of Sampling
• Simple Random Sampling
• Stratified Random Sampling
• Cluster Random Sampling
• Systematic Random Sampling
• Multi stage Sampling
• Sampling Error
• Methods Of Collecting Data
MODULE 3: EXPLORATORY DATA ANALYSIS
• Exploratory Data Analysis Introduction
• Measures Of Central Tendencies: Mean,Median And Mode
• Measures Of Central Tendencies: Range, Variance And Standard Deviation
• Data Distribution Plot: Histogram
• Normal Distribution & Properties
• Z Value / Standard Value
• Empirical Rule and Outliers
• Central Limit Theorem
• Normality Testing
• Skewness & Kurtosis
• Measures Of Distance: Euclidean, Manhattan And Minkowski Distance
• Covariance & Correlation
MODULE 4: HYPOTHESIS TESTING
• Hypothesis Testing Introduction
• P- Value, Critical Region
• Types of Hypothesis Testing
• Hypothesis Testing Errors : Type I And Type II
• Two Sample Independent T-test
• Two Sample Relation T-test
• One Way Anova Test
• Application of Hypothesis testing
MODULE 1: MACHINE LEARNING INTRODUCTION
• What Is ML? ML Vs AI
• Clustering, Classification And Regression
• Supervised Vs Unsupervised
MODULE 2: PYTHON NUMPY PACKAGE
• Introduction to Numpy Package
• Array as Data Structure
• Core Numpy functions
• Matrix Operations, Broadcasting in Arrays
MODULE 3: PYTHON PANDAS PACKAGE
• Introduction to Pandas package
• Series in Pandas
• Data Frame in Pandas
• File Reading in Pandas
• Data munging with Pandas
MODULE 4: VISUALIZATION WITH PYTHON - Matplotlib
• Visualization Packages (Matplotlib)
• Components Of A Plot, Sub-Plots
• Basic Plots: Line, Bar, Pie, Scatter
MODULE 5: PYTHON VISUALIZATION PACKAGE - SEABORN
• Seaborn: Basic Plot
• Advanced Python Data Visualizations
MODULE 6: ML ALGO: LINEAR REGRESSSION
• Introduction to Linear Regression
• How it works: Regression and Best Fit Line
• Modeling and Evaluation in Python
MODULE 7: ML ALGO: LOGISTIC REGRESSION
• Introduction to Logistic Regression
• How it works: Classification & Sigmoid Curve
• Modeling and Evaluation in Python
MODULE 8: ML ALGO: K MEANS CLUSTERING
• Understanding Clustering (Unsupervised)
• K Means Algorithm
• How it works : K Means theory
• Modeling in Python
MODULE 9: ML ALGO: KNN
• Introduction to KNN
• How It Works: Nearest Neighbor Concept
• Modeling and Evaluation in Python
MODULE 1: FEATURE ENGINEERING
• Introduction to Feature Engineering
• Feature Engineering Techniques: Encoding, Scaling, Data Transformation
• Handling Missing values, handling outliers
• Creation of Pipeline
• Use case for feature engineering
MODULE 2: ML ALGO: SUPPORT VECTOR MACHINE (SVM)
• Introduction to SVM
• How It Works: SVM Concept, Kernel Trick
• Modeling and Evaluation of SVM in Python
MODULE 3: PRINCIPAL COMPONENT ANALYSIS (PCA)
• Building Blocks Of PCA
• How it works: Finding Principal Components
• Modeling PCA in Python
MODULE 4: ML ALGO: DECISION TREE
• Introduction to Decision Tree & Random Forest
• How it works
• Modeling and Evaluation in Python
MODULE 5: ENSEMBLE TECHNIQUES - BAGGING
• Introduction to Ensemble technique
• Bagging and How it works
• Modeling and Evaluation in Python
MODULE 6: ML ALGO: NAÏVE BAYES
• Introduction to Naive Bayes
• How it works: Bayes' Theorem
• Naive Bayes For Text Classification
• Modeling and Evaluation in Python
MODULE 7: GRADIENT BOOSTING, XGBOOST
• Introduction to Boosting and XGBoost
• How it works?
• Modeling and Evaluation of in Python
MODULE 1: TIME SERIES FORECASTING - ARIMA
• What is Time Series?
• Trend, Seasonality, cyclical and random
• Stationarity of Time Series
• Autoregressive Model (AR)
• Moving Average Model (MA)
• ARIMA Model
• Autocorrelation and AIC
• Time Series Analysis in Python
MODULE 2: SENTIMENT ANALYSIS
• Introduction to Sentiment Analysis
• NLTK Package
• Case study: Sentiment Analysis on Movie Reviews
MODULE 3: REGULAR EXPRESSIONS WITH PYTHON
• Regex Introduction
• Regex codes
• Text extraction with Python Regex
MODULE 4: ML MODEL DEPLOYMENT WITH FLASK
• Introduction to Flask
• URL and App routing
• Flask application – ML Model deployment
MODULE 5: ADVANCED DATA ANALYSIS WITH MS EXCEL
• MS Excel core Functions
• Advanced Functions (VLOOKUP, INDIRECT..)
• Linear Regression with EXCEL
• Data Table
• Goal Seek Analysis
• Pivot Table
• Solving Data Equation with EXCEL
MODULE 6: AWS CLOUD FOR DATA SCIENCE
• Introduction of cloud
• Difference between GCC, Azure, AWS
• AWS Service ( EC2 instance)
MODULE 7: AZURE FOR DATA SCIENCE
• Introduction to AZURE ML studio
• Data Pipeline
• ML modeling with Azure
MODULE 8: INTRODUCTION TO DEEP LEARNING
• Introduction to Artificial Neural Network, Architecture
• Artificial Neural Network in Python
• Introduction to Convolutional Neural Network, Architecture
• Convolutional Neural Network in Python
MODULE 1: DATABASE INTRODUCTION
• DATABASE Overview
• Key concepts of database management
• Relational Database Management System
• CRUD operations
MODULE 2: SQL BASICS
• Introduction to Databases
• Introduction to SQL
• SQL Commands
• MY SQL workbench installation
MODULE 3: DATA TYPES AND CONSTRAINTS
• Numeric, Character, date time data type
• Primary key, Foreign key, Not null
• Unique, Check, default, Auto increment
MODULE 4: DATABASES AND TABLES (MySQL)
• Create database
• Delete database
• Show and use databases
• Create table, Rename table
• Delete table, Delete table records
• Create new table from existing data types
• Insert into, Update records
• Alter table
MODULE 5: SQL JOINS
• Inner Join, Outer Join
• Left Join, Right Join
• Self Join, Cross join
• Windows function: Over, Partition, Rank
MODULE 6: SQL COMMANDS AND CLAUSES
• Select, Select distinct
• Aliases, Where clause
• Relational operators, Logical
• Between, Order by, In
• Like, Limit, null/not null, group by
• Having, Sub queries
MODULE 7 : DOCUMENT DB/NO-SQL DB
• Introduction of Document DB
• Document DB vs SQL DB
• Popular Document DBs
• MongoDB basics
• Data format and Key methods
MODULE 1: GIT INTRODUCTION
• Purpose of Version Control
• Popular Version control tools
• Git Distribution Version Control
• Terminologies
• Git Workflow
• Git Architecture
MODULE 2: GIT REPOSITORY and GitHub
• Git Repo Introduction
• Create New Repo with Init command
• Git Essentials: Copy & User Setup
• Mastering Git and GitHub
MODULE 3: COMMITS, PULL, FETCH AND PUSH
• Code Commits
• Pull, Fetch and Conflicts resolution
• Pushing to Remote Repo
MODULE 4: TAGGING, BRANCHING AND MERGING
• Organize code with branches
• Checkout branch
• Merge branches
• Editing Commits
• Commit command Amend flag
• Git reset and revert
MODULE 5: GIT WITH GITHUB AND BITBUCKET
• Creating GitHub Account
• Local and Remote Repo
• Collaborating with other developers
MODULE 1: BIG DATA INTRODUCTION
• Big Data Overview
• Five Vs of Big Data
• What is Big Data and Hadoop
• Introduction to Hadoop
• Components of Hadoop Ecosystem
• Big Data Analytics Introduction
MODULE 2 : HDFS AND MAP REDUCE
• HDFS – Big Data Storage
• Distributed Processing with Map Reduce
• Mapping and reducing stages concepts
• Key Terms: Output Format, Partitioners,
• Combiners, Shuffle, and Sort
MODULE 3: PYSPARK FOUNDATION
• PySpark Introduction
• Spark Configuration
• Resilient distributed datasets (RDD)
• Working with RDDs in PySpark
• Aggregating Data with Pair RDDs
MODULE 4: SPARK SQL and HADOOP HIVE
• Introducing Spark SQL
• Spark SQL vs Hadoop Hive
MODULE 1: TABLEAU FUNDAMENTALS
• Introduction to Business Intelligence & Introduction to Tableau
• Interface Tour, Data visualization: Pie chart, Column chart, Bar chart.
• Bar chart, Tree Map, Line Chart
• Area chart, Combination Charts, Map
• Dashboards creation, Quick Filters
• Create Table Calculations
• Create Calculated Fields
• Create Custom Hierarchies
MODULE 2: POWER-BI BASICS
• Power BI Introduction
• Basics Visualizations
• Dashboard Creation
• Basic Data Cleaning
• Basic DAX FUNCTION
MODULE 3 : DATA TRANSFORMATION TECHNIQUES
• Exploring Query Editor
• Data Cleansing and Manipulation:
• Creating Our Initial Project File
• Connecting to Our Data Source
• Editing Rows
• Changing Data Types
• Replacing Values
MODULE 4: CONNECTING TO VARIOUS DATA SOURCES
• Connecting to a CSV File
• Connecting to a Webpage
• Extracting Characters
• Splitting and Merging Columns
• Creating Conditional Columns
• Creating Columns from Examples
• Create Data Model
Data Science encompasses the comprehensive field dedicated to deriving valuable insights and knowledge from extensive sets of both structured and unstructured data. It relies on various methods, algorithms, and systems to analyze, interpret, and present information effectively.
The Data Science process involves the systematic collection, cleansing, and analysis of data to uncover meaningful patterns and trends. Statistical models, machine learning algorithms, and data visualization techniques are commonly employed to make well-informed decisions based on the gathered information.
Data Science finds practical applications in diverse areas, including predictive analytics, fraud detection, recommendation systems, sentiment analysis, and the optimization of business processes across various industries.
Critical components of a Data Science pipeline include data collection, data cleaning, exploratory data analysis (EDA), feature engineering, model training, model evaluation, and deployment, all working together to extract valuable insights from data.
Data Science commonly relies on programming languages like Python and R, known for their extensive libraries and frameworks facilitating tasks such as data manipulation, analysis, and machine learning.
Machine learning plays a pivotal role in Data Science by empowering systems to learn patterns from data and make predictions or decisions without explicit programming. This capability enhances the extraction of valuable insights from complex datasets.
The concept of Big Data is closely intertwined with Data Science, as it involves managing and analyzing vast datasets that may challenge conventional tools. Data Science methodologies and algorithms are frequently applied to overcome the challenges posed by Big Data, extracting meaningful insights from these massive datasets.
The adaptability of Data Science is apparent across various sectors such as healthcare, finance, marketing, and manufacturing. Its applications span from streamlining operational processes to enhancing decision-making and overall business efficacy.
While Data Science encompasses a broader range of tasks, including data cleaning, exploration, and visualization, machine learning specifically focuses on developing algorithms that empower systems to learn patterns and make predictions.
Certification courses in Data Science are accessible to individuals from diverse backgrounds, including IT professionals, statisticians, analysts, and business experts. A foundational understanding of statistics and programming proves advantageous for those embarking on the journey of mastering Data Science.
In 2024, the data science job market in Phnom Penh is thriving, experiencing a notable increase in demand for skilled professionals.
Participating in data science internships proves advantageous in Phnom Penh, offering practical experiences that enhance one's employability within the field.
According to a Glassdoor report, the salary for data scientists in Phnom Penh varies, with figures starting from KHR 67,00,000 per year.
Certainly, individuals without prior experience can enroll in data science courses and secure employment in Phnom Penh, as companies are increasingly willing to hire skilled beginners.
Enrolling in data science training courses in Phnom Penh does not necessitate a postgraduate degree; many programs accept candidates with relevant undergraduate backgrounds.
Businesses in Phnom Penh leverage data science for growth by improving decision-making processes, optimizing operations, and enhancing overall customer experiences.
In the finance sector, data science is applied to areas such as risk management, fraud detection, and predictive analytics.
Data science contributes to e-commerce by fueling recommendation systems, enabling personalized marketing strategies, and facilitating accurate demand forecasting.
In the realm of cybersecurity, data science plays a pivotal role in identifying anomalies, recognizing patterns, and bolstering overall threat detection and prevention measures.
In manufacturing and supply chain management, data science is instrumental in optimizing production processes, predicting demand, and enhancing overall logistics efficiency.
Datamites™ Certified Data Scientist course provides comprehensive programming, statistics, machine learning, and business knowledge training. Emphasizing Python as the primary language (with optional use of R), the course lays a strong foundation in data science. Completion leads to an IABAC™ certificate, preparing individuals for successful careers as adept data science professionals.
While a statistical background can be beneficial, it is not always a mandatory requirement for a data science career in Phnom Penh. Proficiency in relevant tools, programming languages, and practical problem-solving skills often take precedence in the field.
DataMites offers a variety of certifications in Phnom Penh, including a Diploma in Data Science, Certified Data Scientist, Data Science for Managers, Data Science Associate, Statistics for Data Science, Python for Data Science, and specialized courses in Operations, Marketing, HR, Finance, among others.
For beginners in Phnom Penh, available courses include Certified Data Scientist, Data Science Foundation, and Diploma in Data Science, providing foundational knowledge to kickstart a career in data science.
In Phnom Penh, DataMites provides specialized courses tailored for professionals, covering Statistics for Data Science, Data Science with R Programming, Python for Data Science, Data Science Associate, and certifications in Operations, Marketing, HR, and Finance.
The data science course offered by DataMites in Phnom Penh has a duration of 8 months, ensuring a thorough learning experience.
Career mentoring sessions at DataMites are interactive, offering personalized guidance on resume building, interview preparation, and career strategies. Participants gain valuable insights and tactics to enhance their professional journey in the field of data science.
Upon completing the training, participants receive the prestigious IABAC Certification from DataMites. This globally recognized certification validates their proficiency in data science concepts and applications, enhancing credibility in the industry.
To excel in Certified Data Scientist Training in Phnom Penh, a strong foundation in mathematics, statistics, and programming is essential. Candidates should possess analytical skills, proficiency in either Python or R, and hands-on experience with extensive datasets and tools like Hadoop or SQL databases.
Opting for online data science training in Phnom Penh from DataMites offers benefits such as self-paced learning, accessibility from any location, a curriculum aligned with industry requirements, industry-relevant content, guidance from experienced instructors, and engaging learning experiences through interactive features.
The pricing for data science training in Phnom Penh with DataMites varies, ranging from KHR 2,156,268 to KHR 5,391,285.
DataMites' Data Scientist Course in Phnom Penh incorporates practical learning through over 10 capstone projects, including a dedicated client/live project for real-world application and exposure to industry practices.
Instructors at DataMites undergo selection based on certifications, substantial industry experience, and demonstrated expertise in the subject matter. The data science training sessions are led by these qualified and experienced professionals.
DataMites provides flexible learning options, including Live Online sessions and self-study, allowing participants to choose the method that aligns with their preferences and learning styles.
The Flexi-Pass feature in DataMites' Certified Data Scientist Course enables participants to join multiple batches for a comprehensive learning experience. This allows them to revisit topics, clarify doubts, and deepen their understanding across various sessions, contributing to a more thorough grasp of the material.
Yes, upon the completion of the Data Science Course in Phnom Penh, DataMites issues a Certificate of Completion, affirming participants' competence in data science.
Participants are required to bring a valid Photo ID Proof, such as a National ID card or Driving License, to obtain a Participation Certificate and schedule the certification exam if necessary.
In case of a missed session in the DataMites Certified Data Scientist Course in Phnom Penh, participants can access recorded sessions or engage in support sessions to catch up on missed content and address any queries.
Certainly, prospective participants at DataMites have the option to attend a demo class before enrolling in the Certified Data Scientist Course in Phnom Penh. This allows them to evaluate the teaching style, course content, and overall structure before committing.
DataMites incorporates internships into its certified data scientist course in Phnom Penh, delivering a distinctive learning experience that blends theoretical knowledge with practical industry exposure.
Upon successful completion of the Data Science training, participants receive an internationally recognized IABAC® certification. This certification validates their expertise in the field and enhances their employability on a global scale.
The DataMites Placement Assistance Team(PAT) facilitates the aspirants in taking all the necessary steps in starting their career in Data Science. Some of the services provided by PAT are: -
The DataMites Placement Assistance Team(PAT) conducts sessions on career mentoring for the aspirants with a view of helping them realize the purpose they have to serve when they step into the corporate world. The students are guided by industry experts about the various possibilities in the Data Science career, this will help the aspirants to draw a clear picture of the career options available. Also, they will be made knowledgeable about the various obstacles they are likely to face as a fresher in the field, and how they can tackle.
No, PAT does not promise a job, but it helps the aspirants to build the required potential needed in landing a career. The aspirants can capitalize on the acquired skills, in the long run, to a successful career in Data Science.