Home > Research > Research programmes > Algorithmic Data Analysis > Data Mining
The project develops methods for the exploratory data analysis of large and highdimensional data sets. One of the themes has been finding frequent patterns in large collections of data. The pattern classes include ordered and unordered patterns. Currently areas of interest include condensed representations and the combination of combinatorial and probabilistic techniques for approximating distributions. For sequential data, interests are in algorithms for sequence segmentation under various restrictions and in discovery of order from unordered data sets. Also issues in subspace clustering and spectral methods have been studied.
In 2005 there were several interesting developments. The methods on seriation problems in paleontological and other applications advanced very considerably, and the publications were accepted to important forums. The novel problem setting of mining chains of relations has great promises, as well as the work on condensed representations and on spatial clustering. Special emphasis was given to work on finding partial orders from data.
Data Mining [2], Prof. Heikki Mannila, Prof. Hannu Toivonen
See www.cs.helsinki.fi/research/fdk/datamining [3] for further information and publications.
Links:
[1] http://www.hiit.fi/ada/datamining
[2] http://www.hiit.fi/node/18
[3] http://www.cs.helsinki.fi/research/fdk/datamining
Last update: 10 Dec, 2007. Page content by: Webmaster.