The following code records how I load csv files to R and run hierarchical clustering algorithm on the data. I have to say that using R to plot the data is extremely EASY to do!
> df <- read.csv("C:/Users/Creek/Desktop/data.txt", row.names=1,head=FALSE, sep=",") > d <- dist(as.matrix(df)) > hc <- hclust(d) > plot(hc) > q()
row.names=1 specifies the first column is name/label, not data of variables.
head=FALSE specifies the is on head data.
Here is a cluster dendrogram:
You can also cut the cluster at some height. You can cut by specifying the number of clusters by using:
rect.hclust(hc, k=5, border="red")
or you can cut by setting the height:
rect.hclust(hc, h=0.5, border="red")
<pre><code> String foo = "bar"; </code></pre>