Date of Award

5-2019

Document Type

Campus Access Dissertation

Degree Name

Doctor of Philosophy (PhD)

Department

Computer Science

First Advisor

Dan A. Simovici

Second Advisor

Marc Pomplun

Third Advisor

Wei Ding

Abstract

The demand for new techniques brought by data mining has been skyrocketting in recent years due to the massive increase in data available in many areas such as science, engineering, finance, biological research, medicine etc. We introduce several combinatorial algorithms that improve mining of binary data sets. Our contribution consists of a genetic algorithm to mine frequent itemsets and large bite sets, an algorithm that identifies determining sets for index functions, a compression algorithm that utilizes Gini-based distances between attributes in datasets, and an algorithm to find Vapnik-Chervonenkis dimension of binary tables. Also, we propose a new approach to genetic algorithms and introduce a novel type-based genetic algorithm, which we apply to two well-known problems: N-queen problem and finding the global minimum to the Rosenbrock function.

Comments

Free and open access to this Campus Access Dissertation is made available to the UMass Boston community by ScholarWorks at UMass Boston. Those not on campus and those without a UMass Boston campus username and password may gain access to this dissertation through resources like Proquest Dissertations & Theses Global or through Interlibrary Loan. If you have a UMass Boston campus username and password and would like to download this work from off-campus, click on the "Off-Campus UMass Boston Users" link above.

Share

COinS