German | English

IPK Gatersleben > Research > Dept. Molecular Genetics > Data Inspection
 

 

Data Inspection
Head: Dr Swetlana Friedel (née Nikolajewa)
Tel: +49 (0)39482 5182
Fax: +49 (0)39482 5137
Email: friedel@ipk-gatersleben.de

Jens Keilwagen, Swetlana Friedel, Michael Seifert 
Research Interest
The junior research group Data Inspection develops and studies novel computer algorithms for the analysis of biological high-throughput data.
The central goal of biological experiments is the collection of new and meaningful data. Measurements do often reflect a condensate of laborious and costly work concerning experimental design, setup, and execution. Since data analysis is the key to gain information and further knowledge about the subject of interest, faithful data inspection methods are required to extract substantial facts from the measurements. 

In addition to the analysis of existing data sets, data inspection aims at creating auxiliary data models that integrate prior knowledge, such as contrast information, class labels, or even only loosely associated observations. Models of this kind include Markov models, such as hidden Markov models (HMM) or Markov random fields (MRF), and extensions of prototype models, such as self-organizing maps (SOM), neural gas (NG), and learning vector quantization (LVQ).

These models are data-driven, i.e. the data space induces a specialized model space for facilitating and focusing the analysis. Problem-specific metric adaptation is one particular and powerful case of this concept, allowing feature rating and feature selection for biomarker detection, as well as improved clustering and classification. Other modeling targets are motif detection in directed data, trustworthy data visualization, and alternative data views that help to overcome limitations of standard statistical methods.

In order to meet the interests of our cooperation partners, the research emphasis is put on the processing of genomic sequences, macroarray and microarray data, and large gelplot collections. Being data-driven, though, the models allow a very broad range of biological applications.
The research group is funded by the Ministery of Culture of Saxony-Anhalt, grant XP 3624HP/0606T.
[ ^ ]
Recent References
2010200920082007200620052004
[ ^ ] 2010
2010 C. KALETA, A. GOEHLER, S. SCHUSTER, K. JAHREIS, R. GUTHKE & S. FRIEDEL Integrative Inference of Gene-Regulatory Networks in Escherichia coli Using Information Theoretic Concepts and Sequence Analysis. BMC Systems Biology ,vol. 4 pp. 116
R. SINHA, T. LENSER, N. JAHN, U. GAUSMANN, S. FRIEDEL, K. SZAFRANSKI, K. HUSE, P. ROSENSTIEL, J. HAMPE, S. SCHUSTER, M. HILLER, R. BACKOFEN & M. PLATZER TassDB2 - A comprehensive database of subtle alternative splicing events. BMC Bioinformatics 2010, vol. 11 pp. 216
J. KEILWAGEN, J. GRAU, S. POSCH & I. GROSSE Apples and oranges: avoiding different priors in Bayesian DNA sequence analysis. BMC Bioinformatics, vol. 11(1), pp. 149
J. KEILWAGEN, J. GRAU, S. POSCH, M. STRICKERT & I. GROSSE Unifying generative and discriminative learning principles. BMC Bioinformatics, vol. 11(1), pp. 98  Highly accessed.
S. POSCH, J. GRAU, A. GOHR, J. KEILWAGEN & I. GROSSE Probabilistic Approaches to Transcription Factor Binding Site Prediction. In Computational Biology of Transcription Factor Binding. Series: Methods in Molecular Biology, Vol. 674, Ladunga, Istvan (Ed.), 2010, 415 p.  ISBN: 978-1-60761-853-9
N. SREENIVASULU, R. SUNKAR, U. WOBUS & M. STRICKERT Array Platforms and Bioinformatics Tools for the Analysis of Plant Transcriptome in Response to Abiotic Stress. In Plant Stress Tolerance -    Methods in Molecular Biology,    Humana Press,  2010 pp 71-93
[ ^ ] 2009
2009 M. SEIFERT, J. KEILWAGEN, M. STRICKERT & I. GROSSE
Utilizing gene pair orientations for HMM-based analysis of promoter array ChIP-chip data. Bioinformatics, vol. 25(16), pp. 2118-2125 , doi:0.1093/bioinformatics/btp276.
 
2009 M. SEIFERT, A. BANAEI, J. KEILWAGEN, M.F. METTE, A. HOUBEN, F. ROUDIER, V. COLOT, I. GROSSE & M. STRICKERT
Array-based Genome Comparison of Arabidopsis Ecotypes Using Hidden Markov Models. Proc. 2nd International Conference on Bio-inspired Systems and Signal Processing (BIOSIGNALS), pp. 3-11.
 
2009 J. KEILWAGEN, J. BAUMBACH, T. KOHL & I. GROSSE MotifAdjuster: a tool for computational reassessment of transcription factor binding site annotations. Genome Biology 10(5), doi:10.1186/gb-2009-10-5-r46
2009 M. STRICKERT, J. KEILWAGEN, F. -M. SCHLEIF, T. VILLMANN & M. BIEHL Matrix Metric Adaptation Linear Discriminant Analysis of Biomedical Data. Bio-Inspired Systems: Computational and Ambient Intelligence, Springer Lecture Notes in Computer Science, LNCS 5517, pp. 933-940.
2009 M. STRICKERT, F.-M. SCHLEIF, T. VILLMANN & U. SEIFFERT Unleashing Pearson Correlation for Faithful Analysis of Biomedical Data. In: Similarity-Based Clustering - Recent Developments and Biomedical Applications, Springer Lecture Notes in Computer Science, LNCS 5400, pp. 70-91.
2009 M. STRICKERT, A. SOTO, J. KEILWAGEN & G.E. VAZQUEZ Towards matrix-based selection of feature pairs for efficient ADMET prediction. Proceedings of the 10th Argentine Symposium on Artificial Intelligence ASAI 2009, pp. 83-94.
[ ^ ] 2008
2008 J. THIEL, D. WEIER, N. SREENIVASULU, M. STRICKERT, N. WEICHERT, M. MELZER, T. CZAUDERNA, U. WOBUS, H. WEBER & W. WESCHKE
Different hormonal regulation of cellular differentiation and function in nucellar projection and endosperm transfer cells – a microdissection-based transcriptome study of young barley grains. Plant Physiology 148, pp. 1436-1452.
2008 N. SREENIVASULU, B. USADEL, A. WINTER, V. RADCHUK, U. SCHOLZ, N. STEIN, W. WESCHKE, M. STRICKERT, T.J. CLOSE, M. STITT, A. GRANER & U. WOBUS Barley grain maturation and germination: Metabolic pathway and regulatory network commonalities and differences highlighted by new MapMan/PageMan profiling tools. Plant Physiol. 146: 1738-1758.
2008 M. STRICKERT, F.-M. SCHLEIF, U. SEIFFERT & T. VILLMANN
Derivatives of Pearson Correlation for Gradient-based Analysis of Biomedical Data. Revista Iberoamericana de Inteligencia Artificial 12 (37) 37-44.
 
2008 M. STRICKERT, K. WITZEL, J. KEILWAGEN, H.-P. MOCK, P. SCHNEIDER & M. BIEHL
Adaptive matrix metrics for attribute dependence analysis in differential high-throughput data. Proc. 5th International Workshop on Computational Systems Biology (WCSB), TICSP series 41, pp. 181-184.
 
2008 M. STRICKERT, N. SREENIVASULU, T. VILLMANN & B. HAMMER
Robust centroid-based clustering using derivatives of Pearson correlation. Proc. International Conference on Biomedical Engineering Systems and Technologies. INSTICC Publications, pp. 197-203.
 
2008 M. STRICKERT, P. SCHNEIDER, J. KEILWAGEN, T. VILLMANN, M. BIEHL & B. HAMMER
Discriminatory data mapping by matrix-based supervised learning metrics. Lecture Notes in Computer Science, LNCS 5065, pp. 78-89.
 
2008 M. STRICKERT, F.-M. SCHLEIF & T. VILLMANN
Metric adaptation for supervised attribute rating. European Symposium on Artificial Neural Networks (ESANN), D-facto Publications, pp. 31-36.
 
[ ^ ] 2007
2007 STRICKERT, M., N. SREENIVASULU, B. USADEL & U. SEIFFERT
Correlation-maximizing surrogate gene space for visual mining of gene expression patterns in developing barley endosperm tissue.
BMC Bioinformatics 22;8(1): 165.
2007 STRICKERT, M. & U. SEIFFERT Online proceedings of the Dagstuhl Seminar 'Similarity-based Clustering and its Application to Medicine and Biology'.
2007 VILLMANN, T., M. STRICKERT, C. BRÜß, F.-M. SCHLEIF & U. SEIFFERT Visualization of Fuzzy Information in Fuzzy-Classification for Image Segmentation using MDS. Proc. of the European Symp. on Artificial Neural Networks (ESANN). D-Side publishers Evere/Belgium, pp. 103-107.
2007 HAMMER, B., A. HASENFUß, F.-M. SCHLEIF, T. VILLMANN & M. STRICKERT
Intuitive Clustering of Biological Data.
Proc. of the International Joint Conference on Artificial Neural Networks (IJCNN), ISSN 1-4244-1380-X.
2007 HAMMER, B., A. HASENFUß, F. ROSSI & M. STRICKERT
Topographic Processing of Relational Data.
Online proceedings of the International Workshop on Self-Organizing Maps (WSOM), ISBN 978-3-00-022473-7.
2007 STRICKERT, M., F.-M. SCHLEIF & U. SEIFFERT
Gradients of Pearson Correlation for the Analysis of Biomedical Data.
Proc. of the Argentine Symp. on Artificial Intelligence (ASAI), pp. 139-150, ISSN 1850-2784.
2007 STRICKERT, M., K. WITZEL, H.-P. MOCK, F.-M. SCHLEIF & T. VILLMANN Supervised Attribute Relevance Determination for Protein Identification in Stress Experiments. Proceedings of Machine Learning in Systems Biology (MLSB).
2007 HAMMER, B., A. HASENFUß, F. ROSSI & M. STRICKERT
Topographic Processing of Relational Data. The 6th International Workshop on Self-Organizing Maps (WSOM), ISBN 978-3-00-022473-7
2007 VILLMANN, T., F.-M. SCHLEIF, E. MERENYI, M. STRICKERT & B. HAMMER
Class imaging of hyperspectral satellite remote sensing data using FLSOM. The 6th International Workshop on Self-Organizing Maps (WSOM), ISBN 978-3-00-022473-7
2007 WESCHKE, W., H.-P MOCK, C. PIETSCH, V. RADCHUK, M.S. RÖDER, F. SCHREIBER, U. SEIFFERT, N. SREENIVASULU, M. STRICKERT, K. WITZEL & U. WOBUS "Genetical Genomics" der Gerstenkornentwicklung - von der Genexpression zu landwirtschaftlich bedeutsamen Merkmalen. GenomXPress 1: 12-16.
[ ^ ] 2006
[ ^ ] 2005
[ ^ ] 2004
[ ^ ]
Staff
scientific staff
Friedel, Dr. Swetlana +49 (0)39482 5182
Keilwagen, Jens +49 (0)39482 5188
Seifert, Michael +49 (0)39482 5336
staff or visitors
Scharfe, Michael
[ ^ ]
Interesting Links
http://dig.ipk-gatersleben.de/
Data Inspection Group Website 
[ ^ ]