- Building Machine Learning Systems with Python
- Willi Richert Luis Pedro Coelho
- 148字
- 2021-08-13 16:35:47
Tweaking the parameters
So what about all the other parameters? Can we tweak them all to get better results?
Sure. We could, of course, tweak the number of clusters or play with the vectorizer's max_features
parameter (you should try that!). Also, we could play with different cluster center initializations. There are also more exciting alternatives to KMeans itself. There are, for example, clustering approaches that also let you use different similarity measurements such as Cosine similarity, Pearson, or Jaccard. An exciting field for you to play.
But before you go there, you will have to define what you actually mean by "better". Scikit has a complete package dedicated only to this definition. The package is called sklearn.metrics
and also contains a full range of different metrics to measure clustering quality. Maybe that should be the first place to go now, right into the sources of the metrics package.
- Mastering Entity Framework Core 2.0
- Python入門很簡單
- Building a Home Security System with Raspberry Pi
- Learning Raspbian
- C#程序設計教程(第3版)
- iPhone應用開發從入門到精通
- R用戶Python學習指南:數據科學方法
- Webpack實戰:入門、進階與調優(第2版)
- 智能手機故障檢測與維修從入門到精通
- 零代碼實戰:企業級應用搭建與案例詳解
- Laravel Application Development Blueprints
- Distributed Computing in Java 9
- Python商務數據分析(微課版)
- Java并發編程:核心方法與框架
- Android Game Programming by Example