- NMF
W basis
H coefficients
-
Rank選擇
圖
- Sparseness
殘差和殘差平方和自不必說
-
dispersion 離差
- In statistics, dispersion (also called variability, scatter, or spread) is the extent to which a distribution is stretched or squeezed.[1] Common examples of measures of statistical dispersion are the variance, standard deviation, and interquartile range. 也就是和方差识窿、標(biāo)準(zhǔn)差類似的概念
-
silhouette
Silhouette refers to a method of interpretation and validation of consistency within clusters of data. The technique provides a succinct graphical representation of how well each object lies within its cluster.[1]
The silhouette value is a measure of how similar an object is to its own cluster (cohesion) compared to other clusters (separation). The silhouette ranges from ?1 to +1, where a high value indicates that the object is well matched to its own cluster and poorly matched to neighboring clusters. If most objects have a high value, then the clustering configuration is appropriate. If many points have a low or negative value, then the clustering configuration may have too many or too few clusters.
The silhouette can be calculated with any distance metric, such as the Euclidean distance or the Manhattan distance.
-
cophenetic 同表象相關(guān)
- 描述
http://blog.sciencenet.cn/home.php?mod=space&uid=3406804&do=blog&quickforward=1&id=1175517
-
選擇Rank方法匯總
選擇cophenetic correlation coefficient 開始下降的最小Rank值
選擇cophenetic隨rank值變化曲線中最大變動的前一個點
選擇RSS出現(xiàn)一個拐點(inflection point)
- 選擇觀測數(shù)據(jù)殘差減少大于隨機(jī)數(shù)據(jù)殘差減少的最小Rank
更新:沒用一個完全確定的方法可以自動確定數(shù)目斩郎,一般同時根據(jù)聚類的可重復(fù)性和殘差進(jìn)行判斷。