A new parallel data geometry analysis algorithm to select training data for support vector machine