시리즈

Data Mining

1.1. Introduction

1. Why data mining? 1.1 Challenge > we are drowning in data, but starving for knowledge the key problem is not collecting data, but extracting meanin

2026년 3월 22일

2.2. Bayesian Decision Theory

1. Pattern Discovery in Data 데이터에서 규칙성을 찾는 것은 고전 역학부터 양자 역학에 이르기까지 과학 발전의 근간이 되어온 중요한 문제이다. 1.1 Pattern Recognition automatically discover regualari

2026년 3월 28일

3.3. Basic Information Theory

Many data mining tasks involve making predictions under uncertainty, such as in classification where the goal is to predict a class label $Y$ from an

2026년 4월 5일

4.4. Decision Tree (1)

a data analysis task learns a model classifier predicts categorical (discreate) class labels loan approaval medical diagnosis spam detection autonomou

2026년 4월 9일

5.4. Decision Tree(2)

1. Decision Tree Pruning why do we need pruning? overfitting complex poor performance on unseen data a very detailed tree memorizes the training dat

2026년 4월 11일

6.5. Linear Model (1)

weighted coordinates are combined to form a 'credit score' the resulting score is then compared to a threshold valuefor input $x = (x_1, ..., x_d)$, a

2026년 4월 12일

7.5. Linear Model (2)

1. Regression 1.1 Definition a statical method to study relationship between $\mathbf{x}$ and y $\mathbf{x}$: covariate / predictor variable / indep

2026년 4월 14일

8.5. Linear Model (3)

$X \\in \\R^{N \\times (d+1)}$rows: inputs $\\mathbf{x}\_n$ as row vectors 각 개별 데이터 벡터$\\mathbf{x}\_n$에 1(bias)이라는 항목을 추가한 뒤, 데이터 행렬 $X$를 만들 때는 개별 벡터들

2026년 4월 20일

9.5. Linear Model (4)

1. Linear Models 1.1 Core of Linear Models signal $s = w^Tx$ combines input variables linearly we have seen two models based on this 입력에 대한 가중치 내적은 각

2026년 5월 13일

10.6. Pattern Mining

frequent patterns they reveal hidden regularities and relationship in data association, correlation, casuality sequential patterns partial periodicity

2026년 5월 21일

11.7. Clustering (1)

Cluster analysis partitions a set of data objects into subsets called clustersObjects within a cluster are similar to each other, while objects in dif

2026년 5월 21일

12.7. Clustering (2)

In many real scenarios, data naturally forms groups at different levels E.g., organization charts or handwriting stylesThe result is a tree structure

약 18시간 전