[Data Science] 2. Data Science and Machine Learning

Notice

Recent Posts

Recent Comments

Link

« 2025/04 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

Tags more

Archives

Today

Total

관리 메뉴

와이유스토리

[Data Science] 2. Data Science and Machine Learning 본문

컴퓨터공학/인공지능|데이터사이언스

[Data Science] 2. Data Science and Machine Learning

유(YOO) 2023. 8. 14. 16:56

Link

https://app.datascientist.fr/learn/learning/57/60/166/762

DataScientist.fr : La plateforme la plus interactive pour apprendre la data science, l'intelligence artificielle et le cloud

app.datascientist.fr

CRISP-DM Process

1. Opportunity Assessment & Business Understanding
2. Data Understanding & Acquisition
3. Data Preparation & Cleaning & Transformation
4. Modeling
5. Evaluation & Residuals & Metrics
6. Model Deployment & Application

Data Preparation

1. Data Collection
- Data augmentation : Rotating the original versions, cropping
them differently, or altering the lighting conditions
- Data labeling

2. Data Processing
- Formatting
- Cleaning : Remove messy data
- Sampling : If you have too much data

3. Data Transformation(Feature engineering)
- Scaling
- Normalizing
- Decomposition
- Feature aggregation : RGB, Channels

* Missing & Repeated value
* Outliers & Errors

Machine Learning

Supervised Learning

1. Classification : Yes/No question
ex) Will it be hot or cold tomorrow?
- Evaluation of Classification
    + Confusion Matrix
        * Recall = TP/(TP+FN)
        * Precision = TP/(TP+FP)
        * Accuracy = (TP+TN)/(TP+TN+FP+FN)
- Types
    + Binary Classification
    + Multiclass Classification
    + Multilabel Classification

2. Regression : Predict a numerical value
ex) What will be the etmperature tomorrow?
- Evaluation of Regression
    + MSE
    + RMSE
    + MAE

Unsupervised Learning

1. Clusting : Group observations into similar-looking groups
- Evaluation of Clustering
    + Internal Measures
        * Cohesion
        * Separation
    + External Measures
        * Compare with Ground Truth
2. Recommender system : Recommendation

Dataset

1. Training Dataset : The sample of data used to fit the model
2. Validation
- Cross Validation
3. Test

Overfitting & Underfitting

1. Overfitting : Forcefitting, Too good to be true
2. Appropriate fitting
3. Under fitting : Too simple to explain the variance

- Model complexity
- Training Error < Test Error

'컴퓨터공학 > 인공지능|데이터사이언스' 카테고리의 다른 글

[Data Science] 4. Generative AI For Computer Vision (0)	2023.10.18
[Data Science] 1. Python Basics For Data Science(NumPy, Pandas) (0)	2023.08.14

'컴퓨터공학/인공지능|데이터사이언스' Related Articles

Comments

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

와이유스토리

와이유스토리

[Data Science] 2. Data Science and Machine Learning 본문

[Data Science] 2. Data Science and Machine Learning

Link

CRISP-DM Process

Data Preparation

Machine Learning

Supervised Learning

Unsupervised Learning

Dataset

Overfitting & Underfitting

'컴퓨터공학 > 인공지능|데이터사이언스' 카테고리의 다른 글

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역