๐Ÿค– Machine Learning

๐Ÿค– Machine Learning

[ML] ๋ถ€์ŠคํŒ…(Boosting)

๋ถ€์ŠคํŒ…(Boosting) `๋ถ€์ŠคํŒ…(Boosting)`์ด๋ž€ ์—ฌ๋Ÿฌ ๊ฐœ์˜ learning ๋ชจ๋ธ์„ ์ˆœ์ฐจ์ ์œผ๋กœ ๊ตฌ์ถ•ํ•˜์—ฌ ์ตœ์ข…์ ์œผ๋กœ ํ•ฉ์น˜๋Š” ๋ฐฉ๋ฒ•์ด๋‹ค. ์—ฌ๊ธฐ์„œ ์‚ฌ์šฉํ•˜๋Š” learning ๋ชจ๋ธ์€ ๋งค์šฐ ๋‹จ์ˆœํ•œ ๋ชจ๋ธ์ด๋‹ค. ์—ฌ๊ธฐ์„œ ๋‹จ์ˆœํ•œ ๋ชจ๋ธ์ด๋ž€ Model that slightly better than chance, ์ฆ‰ ์ด์ง„ ๋ถ„๋ฅ˜์—์„œ ๋ถ„๋ฅ˜ ์„ฑ๋Šฅ์ด 0.5๋ฅผ ์กฐ๊ธˆ ๋„˜๋Š” ์ •๋„์˜ ์ˆ˜์ค€์˜ ๋ชจ๋ธ์„ ๋งํ•œ๋‹ค. ๋ถ€์ŠคํŒ…์€ ๋ชจ๋ธ ๊ตฌ์ถ•์— ์ˆœ์„œ๋ฅผ ๊ณ ๋ คํ•˜๊ธฐ ๋•Œ๋ฌธ์— ๊ฐ ๋‹จ๊ณ„์—์„œ ์ƒˆ๋กœ์šด base learner๋ฅผ ํ•™์Šตํ•˜์—ฌ ์ด์ „ ๋‹จ๊ณ„์˜ base learner์˜ ๋‹จ์ ์„ ๋ณด์™„ํ•˜๋ฉฐ, ๊ฐ ๋‹จ๊ณ„๋ฅผ ๊ฑฐ์น˜๋ฉด์„œ ๋ชจ๋ธ์ด ์ ์ฐจ ๊ฐ•ํ•ด์ง„๋‹ค. ๋ถ€์ŠคํŒ… ๋ชจ๋ธ์˜ ์ข…๋ฅ˜๋กœ๋Š” `AdaBoost`, `GBM`, `XGBoost`, `Light GBM`, `CatBoost` ๋“ฑ์ด ์žˆ๋‹ค. Ada..

๐Ÿค– Machine Learning

[ML] ๋ถˆ๊ท ํ˜• ๋ฐ์ดํ„ฐ ๋ถ„์„์„ ์œ„ํ•œ ์ƒ˜ํ”Œ๋ง ๊ธฐ๋ฒ•

๋ถˆ๊ท ํ˜• ๋ฐ์ดํ„ฐ๋ž€? ๋ถˆ๊ท ํ˜• ๋ฐ์ดํ„ฐ๋ž€, ์ •์ƒ ๋ฒ”์ฃผ์˜ ๊ด€์ธก์น˜ ์ˆ˜์™€ ์ด์ƒ ๋ฒ”์ฃผ์˜ ๊ด€์ธก์น˜ ์ˆ˜์˜ ์ฐจ์ด๊ฐ€ ํฌ๊ฒŒ ๋‚˜ํƒ€๋‚˜๋Š” ๋ฐ์ดํ„ฐ๋ฅผ ๋งํ•œ๋‹ค. ์ฆ‰, ํด๋ž˜์Šค ๋ณ„ ๊ด€์ธก์น˜์˜ ์ˆ˜๊ฐ€ ํ˜„์ €ํ•˜๊ฒŒ ์ฐจ์ด๊ฐ€ ๋‚˜๋Š” ๋ฐ์ดํ„ฐ๋ฅผ ๋ถˆ๊ท ํ˜• ๋ฐ์ดํ„ฐ๋ผ๊ณ  ๋งํ•œ๋‹ค. ๋ถˆ๊ท ํ˜• ๋ฐ์ดํ„ฐ๊ฐ€ ๋ฌธ์ œ๊ฐ€ ๋˜๋Š” ์ด์œ ๋Š” ๋‹ค์ˆ˜์˜ ๋ฐ์ดํ„ฐ๋ฅผ ์ •ํ™•ํžˆ ๋ถ„๋ฅ˜ํ•˜๋Š” ๊ฒƒ๋ณด๋‹ค ์ผ๋ฐ˜์ ์œผ๋กœ ์†Œ์ˆ˜์˜ ๋ฐ์ดํ„ฐ๋ฅผ ์ •ํ™•ํžˆ ๋ถ„๋ฅ˜ํ•˜๋Š” ๊ฒƒ์ด ์ค‘์š”ํ•˜์ง€๋งŒ ๋‹ค์ˆ˜์˜ ๋ฐ์ดํ„ฐ์— ํŽธํ–ฅ๋œ ๋ถ„๋ฅ˜ ๊ฒฝ๊ณ„์„ ์ด ํ˜•์„ฑ๋˜์–ด ์†Œ์ˆ˜์˜ ๋ฐ์ดํ„ฐ๋ฅผ ์ •ํ™•ํžˆ ์ฐพ์•„๋‚ด์ง€ ๋ชปํ•˜๊ธฐ ๋•Œ๋ฌธ์ด๋‹ค. ๋˜ํ•œ ๋‹ค์Œ๊ณผ ๊ฐ™์€ `์ •์˜คํ–‰๋ ฌ(Confusion Matrix)`๊ฐ€ ์žˆ์„ ๋•Œ, ์ด์ƒ(์†Œ์ˆ˜) ๋ฐ์ดํ„ฐ๋ฅผ ์ œ๋Œ€๋กœ ์˜ˆ์ธกํ•˜์ง€ ๋ชปํ–ˆ์Œ์—๋„ ์˜ˆ์ธก ์ •ํ™•๋„๊ฐ€ ๋†’๊ฒŒ ๋‚˜์˜ค๋Š” ๋ชจ๋ธ ์„ฑ๋Šฅ์— ๋Œ€ํ•œ ์™œ๊ณก์ด ์žˆ์„ ์ˆ˜ ์žˆ๋‹ค. ๋ถˆ๊ท ํ˜• ๋ฐ์ดํ„ฐ์˜ ํ•ด๊ฒฐ ๋ฐฉ์•ˆ ํฌ๊ฒŒ ๋‘ ๊ฐ€์ง€๋กœ, ๋ฐ์ดํ„ฐ๋ฅผ ์กฐ์ •ํ•ด์„œ ์ด๋ฅผ ํ•ด๊ฒฐํ•˜๋Š” ..

๐Ÿค– Machine Learning

[ML] ๊ตฐ์ง‘ ๋ถ„์„(Clustering Analysis)

๊ตฐ์ง‘ํ™”(Clustering) `๊ตฐ์ง‘ํ™”(Clustering)`์ด๋ž€ ์˜ˆ์ธก ๋ชจ๋ธ๊ณผ ๊ฐ™์ด ์˜ˆ์ธก์„ ๋ชฉ์ ์œผ๋กœ ํ•˜๋Š” ๊ฒƒ์ด ์•„๋‹Œ, ๋ฐ์ดํ„ฐ๊ฐ€ ์ฃผ์–ด์กŒ์„ ๋•Œ ๋ฐ์ดํ„ฐ ๊ฐ„์— ์œ ์‚ฌํ•œ ์†์„ฑ๋“ค์„ ๊ฐ–๋Š” ๊ด€์ธก์น˜๋“ค์„ ๋ฌถ์–ด ์ „์ฒด ๋ฐ์ดํ„ฐ๋ฅผ ๋ช‡ ๊ฐœ์˜ ๊ตฐ์ง‘(๊ทธ๋ฃน)์œผ๋กœ ๋‚˜๋ˆ„๋Š” ๊ฒƒ์„ ๋งํ•œ๋‹ค. ๊ตฐ์ง‘ํ™”์˜ ๊ธฐ์ค€์€ ๋™์ผํ•œ ๊ตฐ์ง‘์— ์†Œ์†๋œ ๊ด€์ธก์น˜๋“ค์€ ์„œ๋กœ ์œ ์‚ฌํ• ์ˆ˜๋ก ์ข‹์œผ๋ฉฐ, ์ƒ์ดํ•œ ๊ตฐ์ง‘์— ์†Œ์†๋œ ๊ด€์ธก์น˜๋“ค์€ ์„œ๋กœ ๋‹ค๋ฅผ์ˆ˜๋ก ์ข‹๋‹ค. `๋ถ„๋ฅ˜(Classification)`์™€ `๊ตฐ์ง‘ํ™”(Clustering)`์˜ ์ฐจ์ด๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™๋‹ค. ๋ถ„๋ฅ˜ : ์‚ฌ์ „ ์ •์˜๋œ ๋ฒ”์ฃผ๊ฐ€ ์žˆ๋Š”(labeled) ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ ์˜ˆ์ธก ๋ชจ๋ธ์„ ํ•™์Šตํ•˜๋Š” ๋ฌธ์ œ → `์ง€๋„ํ•™์Šต(Supervised Learning)` ๊ตฐ์ง‘ํ™” : ์‚ฌ์ „ ์ •์˜๋œ ๋ฒ”์ฃผ๊ฐ€ ์—†๋Š”(unlabeled) ๋ฐ์ดํ„ฐ์—์„œ ์ตœ์ ์˜ ๊ทธ๋ฃน์„ ์ฐพ๋Š” ๋ฌธ์ œ → `๋น„..

๐Ÿค– Machine Learning

[ML] ๋ถ€๋ถ„์ตœ์†Œ์ œ๊ณฑ๋ฒ•(Partial Least Squares, PLS)

๋ถ€๋ถ„์ตœ์†Œ์ œ๊ณฑ๋ฒ•(Partial Least Squares, PLS) `๋ถ€๋ถ„์ตœ์†Œ์ œ๊ณฑ๋ฒ•(Partial Least Squares, PLS)`์€ X์˜ ์„ ํ˜•๊ฒฐํ•ฉ์˜ ๋ถ„์‚ฐ์„ ์ตœ๋Œ€ํ™”ํ•˜๋Š” ๊ฒƒ๊ณผ ๋”๋ถˆ์–ด X์˜ ์„ ํ˜•๊ฒฐํ•ฉ Z์™€ Y๊ฐ„์˜ ๊ณต๋ถ„์‚ฐ์„ ์ตœ๋Œ€ํ™”ํ•˜๋Š” ๋ณ€์ˆ˜๋ฅผ ์ถ”์ถœํ•˜๋Š” ๋ฐฉ๋ฒ•์ด๋‹ค. PLS๋Š” Y์™€์˜ ๊ณต๋ถ„์‚ฐ์ด ๋†’์€ k๊ฐœ์˜ ์„ ํ˜•์กฐํ•ฉ ๋ณ€์ˆ˜๋ฅผ ์ถ”์ถœ(Supervised feature extraction)ํ•˜๋Š” ๋ฐฉ์‹์ด๋‹ค. PLS(๋ถ€๋ถ„์ตœ์†Œ์ œ๊ณฑ)์˜ ์šฉ์–ด๋Š” ์„ ํ˜•์กฐํ•ฉ์œผ๋กœ ์ถ”์ถœ๋œ ๋ณ€์ˆ˜๊ฐ€ ์„ค๋ช…ํ•˜์ง€ ๋ชปํ•˜๋Š” ๋ถ€๋ถ„์—(๋ฐ์ดํ„ฐ ์ผ๋ถ€๋ถ„) ์ง€์†์ ์œผ๋กœ ์ตœ์†Œ์ œ๊ณฑ๋ฒ•์„ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์—์„œ ์œ ๋ž˜ํ–ˆ๋‹ค. PLS์˜ ์ฃผ์š” ๋ชฉ์ ์€ PCA์™€ ๋™์ผํ•˜๊ฒŒ ํšŒ๊ท€ ๋ฐ ๋ถ„๋ฅ˜ ๋ชจ๋ธ์„ ๊ตฌ์ถ•ํ•˜๊ณ , ๋ฐ์ดํ„ฐ์˜ ์ฐจ์›์„ ์ถ•์†Œ์‹œํ‚ค๋Š” ๊ฒƒ์ด๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ PCA์™€์˜ ์ฐจ์ด์ ์€ ์ถ”์ถœ๋œ ๋ณ€์ˆ˜๊ฐ€ PCA์—์„œ๋Š” ๋ฐ˜์˜ํ•˜์ง€ ๋ชปํ–ˆ๋˜ Y์™€์˜..

๐Ÿค– Machine Learning

[ML] Principal Component Analysis(PCA, ์ฃผ์„ฑ๋ถ„ ๋ถ„์„)

๊ณ ์ฐจ์› ๋ฐ์ดํ„ฐ ๊ณ ์ฐจ์› ๋ฐ์ดํ„ฐ๋ž€ X ๋ณ€์ˆ˜์˜ ์ˆ˜๊ฐ€ ๋งŽ์€ ๋ฐ์ดํ„ฐ๋ฅผ ๋งํ•œ๋‹ค. ์ด๋Š” ๋ณ€์ˆ˜์˜ ์ˆ˜๊ฐ€ ๋งŽ๊ธฐ ๋•Œ๋ฌธ์— ๋ถˆํ•„์š”ํ•œ ๋ณ€์ˆ˜๊ฐ€ ์กด์žฌํ•˜๋ฉฐ ์‹œ๊ฐ์ ์œผ๋กœ ํ‘œํ˜„ํ•˜๊ธฐ ์–ด๋ ต๋‹ค. ๋˜ํ•œ ๊ณ„์‚ฐ ๋ณต์žก๋„๊ฐ€ ์ฆ๊ฐ€ํ•˜๊ธฐ ๋•Œ๋ฌธ์— ๋ชจ๋ธ๋ง์ด ๋น„ํšจ์œจ์ ์ผ ๊ฐ€๋Šฅ์„ฑ์ด ํฌ๋‹ค. ๋”ฐ๋ผ์„œ ์ด ๊ฒฝ์šฐ ์ค‘์š”ํ•œ ๋ณ€์ˆ˜๋งŒ์„ ์„ ํƒํ•ด์„œ ๋ชจ๋ธ๋ง์„ ํ•˜๋Š” ๊ฒƒ์ด ํ•„์š”ํ•˜๋ฉฐ, ์ด๋ฅผ `์ฐจ์› ์ถ•์†Œ(dimension reduction)`๋ผ๊ณ  ํ•œ๋‹ค. ๋ณ€์ˆ˜ ์„ ํƒ/์ถ”์ถœ์„ ํ†ตํ•œ ์ฐจ์› ์ถ•์†Œ ์ฐจ์› ์ถ•์†Œ์˜ ๋ฐฉ๋ฒ•์€ `๋ณ€์ˆ˜ ์„ ํƒ(feature selection)`๊ณผ `๋ณ€์ˆ˜ ์ถ”์ถœ(feature extraction)` ๋‘ ๊ฐ€์ง€๊ฐ€ ์žˆ๋‹ค. ๋ณ€์ˆ˜ ์„ ํƒ์ด๋ž€ ๋ถ„์„ ๋ชฉ์ ์— ๋ถ€ํ•ฉํ•˜๋Š” ์†Œ์ˆ˜์˜ ์˜ˆ์ธก ๋ณ€์ˆ˜๋งŒ์„ ์„ ํƒํ•˜๋Š” ๋ฐฉ๋ฒ•์œผ๋กœ ๋ณธ ๋ฐ์ดํ„ฐ์—์„œ ๋ณ€์ˆ˜๋ฅผ ์„ ํƒํ•˜๊ธฐ ๋•Œ๋ฌธ์— ์„ ํƒํ•œ ๋ณ€์ˆ˜์˜ ํ•ด์„์ด ์šฉ์ดํ•˜์ง€๋งŒ ๋ณ€์ˆ˜๊ฐ„ ์ƒ๊ด€๊ด€๊ณ„๋ฅผ ๊ณ ๋ คํ•˜๊ธฐ ์–ด๋ ต..

๐Ÿค– Machine Learning

[ML] Support Vector Machine(SVM) - Margin, Hard Margin Lienar SVM

์„œํฌํŠธ ๋ฒกํ„ฐ ๋จธ์‹ (Support Vector Machine, SVM) `์„œํฌํŠธ ๋ฒกํ„ฐ ๋จธ์‹ (Support Vector Machine, SVM)`์€ ๊ณ ์ฐจ์› ๋ฐ์ดํ„ฐ์˜ ๋ถ„๋ฅ˜ ๋ฌธ์ œ์— ๊ต‰์žฅํ•œ ์„ฑ๋Šฅ์„ ๋ณด์ด๋Š” ๋ชจ๋ธ์ด๋‹ค. ๋˜ํ•œ SVM์€ 2์ฐจ ๋ฐฉ์ •์‹์œผ๋กœ formulation์„ ํ•œ๋‹ค. ์ฆ‰, `2์ฐจ ๊ณ„ํš๋ฒ•(quadratic programming)`์œผ๋กœ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•œ๋‹ค. ๋Œ€๋ถ€๋ถ„ ๋ฌธ์ œ๋ฅผ ํ’€๋•Œ training data์— ๋Œ€ํ•˜์—ฌ ๋ถ„๋ฅ˜ ์„ฑ๋Šฅ์„ ์ตœ๋Œ€ํ™”ํ•˜๊ธฐ ์œ„ํ•ด ๋…ธ๋ ฅํ•œ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ training data์— ๋„ˆ๋ฌด ๋งž์ถฐ์ง„ ๋ชจ๋ธ์€ ๊ณผ์ ํ•ฉ์˜ ์œ„ํ—˜์ด ์žˆ์œผ๋ฉฐ, ์•Œ๋ ค์ง€์ง€ ์•Š์€ ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•œ ์„ฑ๋Šฅ(generalization ability)๋„ ์ข‹์•„์•ผ ํ•œ๋‹ค. ์—ฌ๊ธฐ์„œ generalization ability์™€ fitting to the training da..

๐Ÿค– Machine Learning

[ML] ๋žœ๋ค ํฌ๋ ˆ์ŠคํŠธ(Random Forest) ๋ชจ๋ธ

๋žœ๋ค ํฌ๋ ˆ์ŠคํŠธ ๋ชจ๋ธ ๋ฐฐ๊ฒฝ - ์•™์ƒ๋ธ” `๋žœ๋ค ํฌ๋ ˆ์ŠคํŠธ(Random Forest)`๋Š” ์•™์ƒ๋ธ” ๊ธฐ๋ฒ•์˜ ํ•˜๋‚˜์˜ ์˜ˆ์ด๋‹ค. `์•™์ƒ๋ธ”(ensemble)`์ด๋ž€, ์—ฌ๋Ÿฌ Base ๋ชจ๋ธ๋“ค์˜ ์˜ˆ์ธก์„ ๋‹ค์ˆ˜๊ฒฐ ๋ฒ•์น™ ๋˜๋Š” ํ‰๊ท ์„ ์ด์šฉํ•ด ํ†ตํ•ฉํ•˜์—ฌ ์˜ˆ์ธก ์ •ํ™•์„ฑ์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ๋ฐฉ๋ฒ•์„ ๋งํ•œ๋‹ค. ์•™์ƒ๋ธ” ๋ชจ๋ธ์€ Base ๋ชจ๋ธ๋“ค์ด ์„œ๋กœ ๋…๋ฆฝ์ ์ด๋ฉฐ, Base ๋ชจ๋ธ๋“ค์ด ๋ฌด์ž‘์œ„ ์˜ˆ์ธก์„ ์ˆ˜ํ–‰ํ•˜๋Š” ๋ชจ๋ธ๋ณด๋‹ค ์„ฑ๋Šฅ์ด ์ข‹์€ ๊ฒฝ์šฐ Base ๋ชจ๋ธ๋ชจ๋‹ค ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ค€๋‹ค. ์•™์ƒ๋ธ” ๋ชจ๋ธ์˜ ์˜ค๋ฅ˜์œจ์€ ๋‹ค์Œ๊ณผ ๊ฐ™์€ ์‹์œผ๋กœ ๋‚˜ํƒ€๋‚ธ๋‹ค. ๋žœ๋ค ํฌ๋ ˆ์ŠคํŠธ ๋ชจ๋ธ์€ `์˜์‚ฌ๊ฒฐ์ •๋‚˜๋ฌด(decision tree)๋ชจ๋ธ`์„ Base ๋ชจ๋ธ๋กœ ์‚ฌ์šฉํ•œ๋‹ค. ์˜์‚ฌ๊ฒฐ์ •๋‚˜๋ฌด๋ชจ๋ธ์€ ๋‹ค์Œ๊ณผ ๊ฐ™์€ ์ด์œ ๋กœ Base ๋ชจ๋ธ๋กœ์จ ํ™œ์šฉ๋„๊ฐ€ ๋†’๋‹ค. Low computational complexity : ๋ฐ์ดํ„ฐ์˜ ํฌ๊ธฐ๊ฐ€..

๐Ÿค– Machine Learning

[ML] ์˜์‚ฌ๊ฒฐ์ •๋‚˜๋ฌด ๋ชจ๋ธ(Decision Tree Model)

์˜์‚ฌ๊ฒฐ์ •๋‚˜๋ฌด ๋ชจ๋ธ ๋ฐ์ดํ„ฐ์— ๋‚ด์žฌ๋˜์–ด ์žˆ๋Š” ํŒจํ„ด์„ ๋ณ€์ˆ˜์˜ ์กฐํ•ฉ์œผ๋กœ ๋‚˜ํƒ€๋‚ด๋Š” ์˜ˆ์ธก/๋ถ„๋ฅ˜ ๋ชจ๋ธ์„ ๋‚˜๋ฌด์˜ ํ˜•ํƒœ๋กœ ๋งŒ๋“œ๋Š” ๊ฒƒ ์งˆ๋ฌธ์„ ๋˜์ ธ์„œ ๋งž๊ณ  ํ‹€๋ฆฌ๋Š” ๊ฒƒ์— ๋”ฐ๋ผ ์šฐ๋ฆฌ๊ฐ€ ์ƒ๊ฐํ•˜๊ณ  ์žˆ๋Š” ๋Œ€์ƒ์„ ์ขํ˜€๋‚˜๊ฐ ์Šค๋ฌด๊ณ ๊ฐœ ๋†€์ด์™€ ๋น„์Šทํ•œ ๊ฐœ๋… ๋ฐ์ดํ„ฐ๋ฅผ 2๊ฐœ ํ˜น์€ ๊ทธ ์ด์ƒ์˜ ๋ถ€๋ถ„์ง‘ํ•ฉ์œผ๋กœ ๋ฐ์ดํ„ฐ๊ฐ€ ๊ท ์ผํ•ด์ง€๋„๋ก ๋ถ„ํ•  ๋ถ„๋ฅ˜ : ๋น„์Šทํ•œ ๋ฒ”์ฃผ๋ฅผ ๊ฐ–๊ณ  ์žˆ๋Š” ๊ด€์ธก์น˜๋ผ๋ฆฌ ๋ชจ์Œ ์˜ˆ์ธก : ๋น„์Šทํ•œ ์ˆ˜์น˜๋ฅผ ๊ฐ–๊ณ  ์žˆ๋Š” ๊ด€์ธก์น˜๋ผ๋ฆฌ ๋ชจ์Œ `๋ฟŒ๋ฆฌ ๋งˆ๋””(Root node)`, `์ค‘๊ฐ„๋งˆ๋””(Intermediate node)`, `๋๋งˆ๋””(Terminal node)`๋กœ ๊ตฌ์„ฑ ์˜ˆ์ธก๋‚˜๋ฌด ๋ชจ๋ธ(Regression Tree) ์˜ˆ์ธก๋‚˜๋ฌด ๋ชจ๋ธ์— ๋Œ€ํ•˜์—ฌ ๋‹ค์Œ 3๊ฐ€์ง€ ๋ฐฉ๋ฒ•์€ ๊ฐ™์€ ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฅด๊ฒŒ ํ‘œํ˜„ํ•œ ๊ฒƒ์ด๋‹ค. ์˜ˆ์ธก๋‚˜๋ฌด ๋ชจ๋ธ๋ง ํ”„๋กœ์„ธ์Šค๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™๋‹ค. ์ด์ œ ๋ถ„ํ• ๋ณ€์ˆ˜(j)์™€ ๋ถ„ํ• ์ (..

๐Ÿค– Machine Learning

[ML] K-nearest neighbors & Distance Measures

๋ถ„๋ฅ˜ ๋ฐ ์˜ˆ์ธก์„ ์œ„ํ•œ ๋ชจ๋ธ ๋ถ„๋ฅ˜ ๋ฐ ์˜ˆ์ธก์„ ์œ„ํ•œ ๋ชจ๋ธ์€ ํฌ๊ฒŒ 2๊ฐ€์ง€๊ฐ€ ์žˆ๋‹ค. ์ฒซ ๋ฒˆ์งธ๋Š” `Model-based Learning`์ด๋‹ค. ์ด๋Š” ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ ๋ชจ๋ธ์„ ์ƒ์„ฑํ•˜์—ฌ ๋ถ„๋ฅ˜ ๋ฐ ์˜ˆ์ธก์„ ์ง„ํ–‰ํ•˜๋Š” ๊ฒƒ์œผ๋กœ ์„ ํ˜•/๋น„์„ ํ˜•๋ชจ๋ธ(linear regression, logistic regression ๋“ฑ), Neural Network, ์˜์‚ฌ ๊ฒฐ์ • ๋‚˜๋ฌด, Support Vector Machine ๋“ฑ์„ ์˜ˆ๋กœ ๋“ค ์ˆ˜ ์žˆ๋‹ค. ๋‘ ๋ฒˆ์งธ๋Š” `Instance-based Learning`์ด๋‹ค. ์ด๋Š” ๋ณ„๋„์˜ ๋ชจ๋ธ ์ƒ์„ฑ ์—†์ด ์ธ์ ‘ ๋ฐ์ดํ„ฐ๋ฅผ ๋ถ„๋ฅ˜ ๋ฐ ์˜ˆ์ธก์— ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์„ ๋งํ•œ๋‹ค. ๋Œ€ํ‘œ์ ์ธ ์˜ˆ๋กœ๋Š” K-nearest neighbor, Locally weighted regression ๋“ฑ์ด ์žˆ๋‹ค. Nearest neighbors KNN ์•Œ๊ณ ๋ฆฌ์ฆ˜..

๐Ÿค– Machine Learning

[ML] ์ •๊ทœํ™” ๋ชจ๋ธ - Regularization ๊ฐœ๋…, Ridge, LASSO, Elastic Net

Good Model ์ข‹์€ ๋ชจ๋ธ์ด๋ž€ ๋‘ ๊ฐ€์ง€๋กœ ๋‚˜๋ˆŒ ์ˆ˜ ์žˆ๋‹ค. ์ฒซ ๋ฒˆ์งธ๋Š” ํ˜„์žฌ ๋ฐ์ดํ„ฐ(training data)๋ฅผ ์ž˜ ์„ค๋ช…ํ•˜๋Š” ๋ชจ๋ธ(Explanatory modeling)์ด๋ฉฐ, ๋‘ ๋ฒˆ์งธ๋Š” ๋ฏธ๋ž˜ ๋ฐ์ดํ„ฐ(testing data)์— ๋Œ€ํ•œ ์˜ˆ์ธก ์„ฑ๋Šฅ์ด ์ข‹์€ ๋ชจ๋ธ(Predictive modeling)์ด๋‹ค. ์ข‹์€ Explanatory model์ด๋ž€ training error๋ฅผ minimizeํ•˜๋Š” ๋ชจ๋ธ์ด๋‹ค. ํƒ€๊นƒ ๊ฐ’์ด ์—ฐ์†ํ˜•์ผ ๊ฒฝ์šฐ MSE ๊ฐ’์„ minimize ํ•˜๋Š” ๊ฒฝ์šฐ์ด๋‹ค. ๋˜ํ•œ ์ข‹์€ Predictive Model์ด๋ž€ ๋ฏธ๋ž˜ ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•œ expected error๊ฐ€ ๋‚ฎ์€ ๋ชจ๋ธ์ด๋‹ค. ์ด ๊ฒฝ์šฐ์—๋Š” Expected MSE๊ฐ’์„ ๊ฐ์†Œ์‹œ์ผœ์•ผ ํ•œ๋‹ค. ์—ฌ๊ธฐ์„œ Irreducible Error๋Š” ์–ด๋–ป๊ฒŒ ํ•  ๋ฐฉ๋ฒ•์ด ์—†๋Š” ์˜ค๋ฅ˜๋ฅผ ๋งํ•˜๋ฉฐ,..

Junyeong Son
'๐Ÿค– Machine Learning' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๊ธ€ ๋ชฉ๋ก