Clustering methods are essential to partitioning biological samples being useful to minimize the information complexity in large datasets. Tools in this context usually generates data with greed algorithms that solves some Data Mining difficulties which can degrade biological relevant information du[r]
5. Data Mining in a Network Setting 5.1. Community and social networks Today’s world is interconnected through many types of links. These links include Web pages, blogs, and emails. Many respondents consider community mining and the mining of social networks as i[r]
[Sullivan at al, 1998]. These techniques are now applied to discover hidden trends and patterns in financial databases, e.g., in stock market data for market prediction. The question in discussions is how to separate real trends and patterns from mirages . Otherwise,[r]
m ộ t khung th ứ c chung c ủ a s ự rút trích đặ c tr ư ng. ABSTRACT Time series data occur in many real life applications, ranging from science and engineering to business. In many of these applications, searching through large time series database based on query sequence is often desirable.[r]
Unsupervised clustering represents one of the most widely applied methods in analysis of highthroughput ‘omics data. A variety of unsupervised model-based or parametric clustering methods and nonparametric clustering methods have been proposed for RNA-seq count data, most of which perform well for l[r]
To ensure cancer patients are stratified towards treatments that are optimally beneficial, it is a priority to define robust molecular subtypes using clustering methods applied to high-dimensional biological data. If each of these methods produces different numbers of clusters for the same data, it[r]
Phát hiện tri thức và khai phá dữ liệu liên quan đến nhiều ngành, nhiều lĩnh vực: thống kê, trí tuệ nhân tạo, cơ sở dữ liệu, thuật toán, tính toán song song và tốc độ cao, thu thập tri thức cho các hệ chuyên gia, quan sát dữ liệu... Đặc biệt phát hiện tri thức và khai phá dữ liệu rất gần gũi với[r]
Department of Psychiatry, National Taiwan University Hospital, Taiwan, R.O.C. {lcyu, clchan}@saturn.yzu.edu.tw, chwu@csie.ncku.edu.tw, linchri@gmail.com Abstract Negative life events, such as death of a family member, argument with a spouse and loss of a job, play an important role in[r]
This study explores social media data in order to understand students’ learning experiences in Vietnamese by integrating both qualitative analysis and data mining techniques. By the qual[r]
Data analysis methods are usually subdivided in two distinct classes: There are methods for prediction and there are methods for exploration. In practice, however, there often is a need to learn from the data in both ways.
... sentences Incremental learning of data 11 Chapter Introduction association is investigated in this work In the association rule mining field, techniques for maintaining discovered association rules in. .. vectors Association analysis in data mining is to find interesting relationships hidden in[r]
generating aggregated results on the same cluster. These aggregated results are then loaded into a Relational DBMS system. Since data is not always moved during the organization phase, the analysis may also be done in a distributed environment, where some data will stay wh[r]
High utility itemsets refer to the sets of items with high utility like profit in a database, and efficient mining of high utility itemsets plays a crucial role in many reallife applications and is an important research issue in data mining area
3.2 Web content mining Web content mining is the application of data mining techniques to the content of web pages. It often viewed as a subset of text mining, however this is not completely accurate as web pages often contain multimedia files that also contrib[r]
3.2 Web content mining Web content mining is the application of data mining techniques to the content of web pages. It often viewed as a subset of text mining, however this is not completely accurate as web pages often contain multimedia files that also contrib[r]
The distributed fuzzy clustering algorithm to handle large data sets using picture fuzzy sets called DPFCM has improved overall clustering quality in comparison with the algorithm of Che[r]
8.8 Classification Problem Extensions In this section we survey a few extensions to the classical classification problem. In classic supervised learning problems, classes are mutually exclusive by defi- nition. In multi-label classification problems each training instance is given[r]
Depending on the data-mining task the quality assessment approaches aim at estimating different aspects of quality. Thus, in the case of classification the quality refers to i) the ability of the designed classification model to correctly classify new data samples, ii) the[r]
65.1 Introduction In the data mining community, there are three basic types of mining: data mining, web min- ing, and text mining (Zhang and Segall, 2008). In addition, there is a special category called supercomputing data mining[r]