Thanks more for sharing the video. If possible can you show an other video on Prep-design and lattice design (i.e. simple and triple lattice) data analysis ? Thank you...
You are welcome Zeru, you can check the recent video on Agricultural statistical tools which is uploaded recently in my data analysis playlist you may find what you're looking for in STAR app manual.
After calculating d sq distance according to privious video. Then according to this video number of cluster is different how we can solve this. Please respond
The topic was very simple clearly & well explained. Your video is now among my go to resources in data science. Just a small constructive criticism : You have already added your R code however the dataset you used (pea.xlsx) hasn't been included. Thus, your script won't work without it & it may have no use for people who wishes to try practice themselves. So, that would be much better if you include the excel file as well, imho. But overall fantastic tutorial & very helpful for rookies 👍
I wanted to perform hierarchical clustering for my data which has 4k observations, I have applied 'NbClust' with smaller number of clusters i.e 2 - 9, and it throws this error - " Error in plot.new() : figure margins too large "
When I used NbClust I am getting this message 'Error in NbClust(data = Table.s, diss = NULL, distance = "euclidean", : The TSS matrix is indefinite. There must be too many missing values. The index cannot be calculated." The silhouette method is given number of clusters as 7
Which method will be suitable for cluster analysis for 300 treatments/genotypes and how can i represent it using dendrogram or any clustering technique for data visualization.
Thank you for the video sir. If scaling is only for quantitative trait what could we do for qualitative traits...how to do cluster analysis for qualitative traits?
Most often we won't consider qualitative traits, if you are considering them the clustering will not be good, if you want to that you can do it in SPSS using two step clustering option.
hello sir, my design was Augmented 2 with 7 blocks ..210 RILs of sorghum , with 6 checks replicated in each block . In case of this should igo for hierarchical clustering analysis using k means and include adjusted mean of 210 lines for analysis. . am I right s? I have to do interpretation on the bases of fodder yield. ..so is their any specification for doing trait wise clustering.. I have taken quantative traits along with shoot fly associated dead heart percent etc traits .. as my RILs are the population of shoot fly susceptible x resistance cross .... so how should I proceed for the analysis on diversity for forage sorghum based on my characters ...total 18 characters were taken..... Should I limit my traits according to my objective of interpretation ..or should I subject all the traits .... .is for non replicated data non hierarchical clustering Is different from hierarchical ?
When I run NbClust(data=rice.s, diss= NULL, distance= "euclidean", min.nc = 2, max.nc = 15, method = "complete", index= "all", alphaBeale = 0.1), then I got the following error: Error: division by zero!Error in solve.default(W) : Lapack routine dgesv: system is exactly singular: U[6,6] = 0 How can I solve this? Thank you.
I have loaded corrected means from augmented RCBD.. But one numerical trait is taken as character.. after changing to numeric it is showing values as 'NA' for all genotypes... This unable to proceed with k means .. it is showing error... How to go about..? Kindly help..
I hope there might be a problem while selecting number of clusters object, in case of that you can manual calculate the average by adding one more extra column alongside of treatments indicating which cluster they belong, later sort them and get the average using MSExcel.
Dear Sir, I am doing cluster analysis of 40,000 points. I used the efficient hierarchical clustering algorithm and successfully did the clustering. Now I want to choose the optimal no of clusters. Which one would you suggest for dealing with large dataset? Because I need to test from k=1 , … , k=40000, which is huge
Hi. I am new to this field. I'am trying to conduct as you illustrated, but I got a pause at clustering with NbClust: Error in NbClust(data = newdata, diss = NULL, distance = "euclidean", : The TSS matrix is indefinite. There must be too many missing values. The index cannot be calculated. Could you help me?
> NbClust(data = pea.s, diss =NULL, distance = "euclidean", min.nc = 2, max.nc = 15, + method = "kmeans" , index = "all", alphaBeale = 0.1) Error in NbClust(data = pea.s, diss = NULL, distance = "euclidean", min.nc = 2, : The TSS matrix is indefinite. There must be too many missing values. The index cannot be calculated. can you tell me how to overcome this problem???
Error in t(jeu) %*% jeu : requires numeric/complex matrix/vector arguments Getting this error while computing the optimum number of clusters using NbClust ...Please help
@@Guruprasad_A: no it does not. I am using the adjusted mean data of the augmented RCBD design. I did that analysis using by seeing your video only "augmented RCBD in R".
@@Guruprasad_A : Got it fixed, thanks alot for the wonderful video,please share how to extract the analysis report in word format, as you did in previous video.
An error is comming while loading package factoextra : package or namespace load failed for 'ggplot2' in loadnamespace (i, c(lib.loc, .libPaths()), versioncheck = vI[[i]] : namespace 'ellipsis' 0.3 .1 is already loaded , but >= 0.3.2 is required. Please provide solution for this
Error in solve.default(W) : system is computationally singular: reciprocal condition number = 6.36052e-19 In addition: Warning message: In log(det(P)/det(W)) : NaNs produced Sir please help with this error.
When I am running the NbClust it gives me error Error in solve.default(W) : system is computationally singular: reciprocal condition number = 8.06347e-18
hello sir , I'm getting error after running this command: NbClust(data = clust.s, diss = NULL, distance = "euclidean", min.nc = 2, max.nc =15,method = "kmeans",index = "all",alphaBeale = 0.1) error is: Error in NbClust(data = clust.s, diss = NULL, distance = "euclidean", : The TSS matrix is indefinite. There must be too many missing values. The index cannot be calculated. please gives your valuable suggestions regarding this