- Building Machine Learning Systems with Python
- Willi Richert Luis Pedro Coelho
- 148字
- 2021-08-13 16:35:47
Tweaking the parameters
So what about all the other parameters? Can we tweak them all to get better results?
Sure. We could, of course, tweak the number of clusters or play with the vectorizer's max_features
parameter (you should try that!). Also, we could play with different cluster center initializations. There are also more exciting alternatives to KMeans itself. There are, for example, clustering approaches that also let you use different similarity measurements such as Cosine similarity, Pearson, or Jaccard. An exciting field for you to play.
But before you go there, you will have to define what you actually mean by "better". Scikit has a complete package dedicated only to this definition. The package is called sklearn.metrics
and also contains a full range of different metrics to measure clustering quality. Maybe that should be the first place to go now, right into the sources of the metrics package.
- 工程軟件開發技術基礎
- Kubernetes實戰
- Mastering SVG
- Web交互界面設計與制作(微課版)
- Python零基礎快樂學習之旅(K12實戰訓練)
- Python 3破冰人工智能:從入門到實戰
- Unity Shader入門精要
- The Complete Coding Interview Guide in Java
- HoloLens與混合現實開發
- Node學習指南(第2版)
- Python預測之美:數據分析與算法實戰(雙色)
- 精通Spring:Java Web開發與Spring Boot高級功能
- SQL Server實例教程(2008版)
- 信息學奧林匹克競賽初賽精講精練
- HTML5+CSS+JavaScript深入學習實錄