Institutional Repository [SANDBOX]
Technical University of Crete
EN  |  EL

Search

Browse

My Space

Performance of multivariate clustering methods in oil families' identification

Karavoulia Christina

Simple record


URIhttp://purl.tuc.gr/dl/dias/05583BB2-C7C4-4BD3-AC9F-F84509207436-
Identifierhttps://doi.org/10.26233/heallink.tuc.68453-
Languageen-
Extent4.54 megabytesen
TitlePerformance of multivariate clustering methods in oil families' identificationen
CreatorKaravoulia Christinaen
CreatorΚαραβουλια Χριστιναel
Contributor [Committee Member]Gaganis Vasileiosen
Contributor [Committee Member]Γαγανης Βασιλειοςel
Contributor [Thesis Supervisor]Pasadakis Nikosen
Contributor [Thesis Supervisor]Πασαδακης Νικοςel
Contributor [Committee Member]Christopoulos Dionysiosen
Contributor [Committee Member]Χριστοπουλος Διονυσιοςel
PublisherΠολυτεχνείο Κρήτηςel
PublisherTechnical University of Creteen
Academic UnitTechnical University of Crete::School of Mineral Resources Engineeringen
Academic UnitΠολυτεχνείο Κρήτης::Σχολή Μηχανικών Ορυκτών Πόρωνel
Content SummaryAs science progresses, the need for analyzing multivariate data sets is growing by the minute. Multiple disciplines, either scientific or not, require the examination of large amounts of data, in a short period of time, in order to obtain useful information. During the recent few decades, multivariate statistical analysis methods have been developed, aiming to satisfy such purposes. This dissertation deals with the implementation of multivariate data analysis methods on a given data set, derived from oil family affiliations, which originate from Williston Basin of North America. In particular, Hierarchical Clustering, k-means and Principal Component analysis have been applied on four independent models, in an attempt to extract information regarding the oil-oil correlations among the samples under study. The models used on the exploration of the compositional information were the Saturated Fraction Compositional Model, the Saturated Fraction Ratios Model, the Gasoline Range Compositional Model and the Biomarkers Compositional Model. These standard statistical methods were found to be quite insufficient in classifying the sample set into distinct familial affiliations. For this reason, the need to examine the nature of the data set arose. Compositional data represent a category on their own as they are characterized by specific numerical properties which present significant consequences when being analyzed by standard multivariate techniques. The analysis of such type of data represents a whole new chapter in the world of statistics and the need for further examination on this matter is constantly growing.en
Type of ItemΜεταπτυχιακή Διατριβήel
Type of ItemMaster Thesisen
Licensehttp://creativecommons.org/licenses/by-nc-nd/4.0/en
Date of Item2017-06-26-
Date of Publication2017-
SubjectOil families' identificationen
SubjectMultivariate clusteringen
Bibliographic CitationChristina Karavoulia, "Performance of multivariate clustering methods in oil families' identification", Master Thesis, School of Mineral Resources Engineering, Technical University of Crete, Chania, Greece, 2017en

Available Files

Services

Statistics