Date of Award
5-2019
Document Type
Campus Access Dissertation
Degree Name
Doctor of Philosophy (PhD)
Department
Computer Science
First Advisor
Dan A. Simovici
Second Advisor
Marc Pomplun
Third Advisor
Wei Ding
Abstract
The demand for new techniques brought by data mining has been skyrocketting in recent years due to the massive increase in data available in many areas such as science, engineering, finance, biological research, medicine etc. We introduce several combinatorial algorithms that improve mining of binary data sets. Our contribution consists of a genetic algorithm to mine frequent itemsets and large bite sets, an algorithm that identifies determining sets for index functions, a compression algorithm that utilizes Gini-based distances between attributes in datasets, and an algorithm to find Vapnik-Chervonenkis dimension of binary tables. Also, we propose a new approach to genetic algorithms and introduce a novel type-based genetic algorithm, which we apply to two well-known problems: N-queen problem and finding the global minimum to the Rosenbrock function.
Recommended Citation
Sizov, Roman A., "Combinatorial Algorithms in Data Mining" (2019). Graduate Doctoral Dissertations. 467.
https://scholarworks.umb.edu/doctoral_dissertations/467
Comments
Free and open access to this Campus Access Dissertation is made available to the UMass Boston community by ScholarWorks at UMass Boston. Those not on campus and those without a UMass Boston campus username and password may gain access to this dissertation through resources like Proquest Dissertations & Theses Global or through Interlibrary Loan. If you have a UMass Boston campus username and password and would like to download this work from off-campus, click on the "Off-Campus UMass Boston Users" link above.