Date of Award

5-31-2016

Document Type

Campus Access Dissertation

Degree Name

Doctor of Philosophy (PhD)

Department

Computer Science

First Advisor

Wei Ding

Second Advisor

Dan Simovici

Third Advisor

Nurit Haspel

Abstract

While instituting data mining methodologies to extract hidden patterns and to learn useful knowledge from real-world Big Data, data scientists have reached certain boundaries due to the limitations of the current computational capabilities in processing and modeling the sheer size of the data with complex heterogeneity. On one hand, the Big Data of certain domains, such as climate data, are temporally and spatially intensive, which require special attention while utilizing the domain knowledge to extract the most suitable features and develop simulation or prediction models embedded with spatio-temporal dimensions. On the other hand, the Big Data in certain domains are temporally and spatially scattered, which present another set of challenges for predictive modeling.

In this dissertation, to overcome the current issues, two methodologies are proposed. The first one is called Dimensional Ensemble Modeling, which identifies spatial signatures, such as points of interest or spatial distributions, within segmented time windows for ensemble modeling. Secondly, Hierarchical Ensemble Modeling, the second methodology proposed, utilizes patterns locally extracted at various granularity levels and constructs a global ensemble pattern in a hierarchical fashion through a specially designed boosting strategy, CCRBoost.

In tight cooperation with domain experts, both of the proposed methodologies have been empirically evaluated in two case studies using two real-world data sets. The proposed methodologies have shown promising results in terms of predictive performance in both cases.

Comments

Free and open access to this Campus Access Dissertation is made available to the UMass Boston community by ScholarWorks at UMass Boston. Those not on campus and those without a UMass Boston campus username and password may gain access to this dissertation through resources like Proquest Dissertations & Theses Global or through Interlibrary Loan. If you have a UMass Boston campus username and password and would like to download this work from off-campus, click on the "Off-Campus UMass Boston Users" link above.

Share

COinS