Wayne edited The_Approach_of_MarkovBin_MarkovBin__.md  about 8 years ago

Commit id: 15583892e0d3c07225ac8899391495128ed59b46

deletions | additions      

       

MarkovBin splits the whole GC-content region to subregions with equal length first and then optimizes by EM algorithm. EM algorithm is well done and we have no way to compete. But the initial separation is really dangerous. To our dataset, the range of GC-content is from 0.15 to 0.75. There are 5 species so the 5 initial subregions given by MarkovBin are [0.15, 0.27], [0.27, 0.39], [0.39, 0.51], [0.51, 0.63], [0.63, 0.75]. From observations in the previous section, we can tell there is something wrong. MarkovBin separates species 2 into two different regions. Unfortunately, the signals from these two regions are similar and strong. We can imagine that species 2 will fulfill 2 subgroups which is a serious mistake.  The result in Table\ref{tab:1} \ref{tab:1}  demonstrates this judgment.