General Ensemble Method

For a complex question, single classifier or predictor may give way to a multiple of them, an aggregated answer is always better, think about your life experience. And today, popular ensemble methods including bagging, boosting and stacking. Random Forest is also prevailing. Voting Classifier We start with the simple intuition, building a voting system over different classifiers, they could be Logistic Regression, SVM Classifier and Decision Tree… Then we count each classifier’s prediction and to get a overall prediction, it can be the majority vote, and this mechanism is call hard voting, or we can add a weight to each classifier, and obviously this is soft voting. »

Frequent Pattern

Suppose we got the following shopping record: transaction item T1 baguette, croissant T2 baguette, croissant, jam T3 madeleine, croissant, baguette, jam The first term is support, a measure of absolute frequency. | item | count | support | | :—-: | :-: | :-: | | baguette | 3 | 0.333 | | croissant | 3 | 0. »

String

Definition and Glossary Strings are everywhere. Text information always requires string process. A String is a sequence of characters. String Sort LSD key-indexed counting, from right to left MSD recursive method, left to right Three-way string quicksort adapt quicksort to MSD string sorting by using 3-way partitioning on the leading character of the keys, moving to the next character on only the middle subarray Tries A search tree, composed of nodes that contain links that are either null or references to other nodes. »

Graph

Definition and Glossary A graph is a set of vertices and a collection of edges that each connect a pair of vertices. A path in a graph is a sequence of vertices connected by edges. A graph is connected if there is a path from every vertex to every other vertex in the graph. A cycle is a path with at least one edge whose first and last vertices are the same. »

Back Propagation

反向传播算法是神经网络中最重要的部分之一。 对于上图中的这个网络,有三个输入层input layer,四个隐藏层hidden layer,一个输出 »