MJay

WSDM - LGBM GPU Verison 되게 하기 본문

Cloud Computing/Machine Learning

WSDM - LGBM GPU Verison 되게 하기

MJSon 2017. 11. 30. 19:03
Edit

WSDM - LGBM GPU Verison 되게 하기


data type not understood 해결하기

pandas version을 다시 깔아야한다.

1.pip uninstall pandas
2.pip install pandas==0.20.3

bin 의미

from GPU-acceleration for Large-scale Tree Boosting.pdf

One way to find the approximate best split is to test only k split positions, and this can be done efficiently using feature histograms. We first convert continuous feature values into k discrete bins, and then construct a histogram with k bins for each feature. To find the split, we can evaluate (4) only at these k points.

params[‘bin_construct_sample_cnt’]

default 값이 200000 이다.

이것을 줄이면 GPU가 쓰인다

CPU 와 GPU time 비교

같은 parameter 사용 , params[‘bin_construct_sample_cnt’] = 256로 바꿈

GPU Processing Time

CPU Processing Time

CPU가 더 빠름 , auc 정확도는 더 낮게 나옴

%23%23%20WSDM%20-%20LGBM%20GPU%20Verison%20%uB418%uAC8C%20%uD558%uAE30%0A@%28Marxico%29%0A**data%20type%20not%20understood%20%uD574%uACB0%uD558%uAE30**%0A%0A%3Epandas%20version%uC744%20%uB2E4%uC2DC%20%uAE54%uC544%uC57C%uD55C%uB2E4.%0A%0A%60%60%60%0Apip%20uninstall%20pandas%0Apip%20install%20pandas%3D%3D0.20.3%0A%60%60%60%0A%0A**bin%20%uC758%uBBF8**%0A%0A%21%5BAlt%20text%5D%28./1511848357988.png%29%0A%0Afrom%20%5BGPU-acceleration%20for%20Large-scale%20Tree%20Boosting.pdf%5D%28https%3A//arxiv.org/pdf/1706.08359.pdf%29%0A%0A%3EOne%20way%20to%20find%20the%20approximate%20best%20split%20is%20to%20test%20only%20k%20split%20positions%2C%20and%20this%20can%20be%20done%20efficiently%20using%20feature%20histograms.%20We%20first%20convert%20continuous%20feature%20values%20into%20k%20discrete%20bins%2C%20and%20then%20construct%20a%20histogram%20with%20k%20bins%20for%20each%20feature.%20To%20find%20the%20split%2C%20we%20can%20evaluate%20%284%29%20only%20at%20these%20k%20points.%20%0A%0A**params%5B%27bin_construct_sample_cnt%27%5D**%20%0A%0A%3E%20default%20%uAC12%uC774%20200000%20%uC774%uB2E4.%0A%0A%3E%20%uC774%uAC83%uC744%20%uC904%uC774%uBA74%20GPU%uAC00%20%uC4F0%uC778%uB2E4%20%0A%0A%21%5BAlt%20text%5D%28./1511848289690.png%29%0A%0A%0A**CPU%20%uC640%20GPU%20time%20%uBE44%uAD50**%0A%0A%3E%uAC19%uC740%20parameter%20%uC0AC%uC6A9%20%2C%20params%5B%27bin_construct_sample_cnt%27%5D%20%3D%20256%uB85C%20%uBC14%uAFC8%20%0A%0AGPU%20Processing%20Time%0A%21%5BAlt%20text%5D%28./1511848398455.png%29%0A%0ACPU%20Processing%20Time%0A%21%5BAlt%20text%5D%28./1511848452772.png%29%0A%0A%3ECPU%uAC00%20%uB354%20%uBE60%uB984%20%2C%20auc%20%uC815%uD655%uB3C4%uB294%20%uB354%20%uB0AE%uAC8C%20%uB098%uC634%0A%0A