German | English

IPK Gatersleben > Research > Dept. Molecular Genetics > Data Inspection
 

 

Data Inspection
Head: Dr Swetlana Friedel (née Nikolajewa)
Tel: +49 (0)39482 5182
Fax: +49 (0)39482 5137
Email: friedel@ipk-gatersleben.de

Jens Keilwagen, Swetlana Friedel, Michael Seifert 
Research Interest
The junior research group Data Inspection develops and studies novel computer algorithms for the analysis of biological high-throughput data.
The central goal of biological experiments is the collection of new and meaningful data. Measurements do often reflect a condensate of laborious and costly work concerning experimental design, setup, and execution. Since data analysis is the key to gain information and further knowledge about the subject of interest, faithful data inspection methods are required to extract substantial facts from the measurements. 

In addition to the analysis of existing data sets, data inspection aims at creating auxiliary data models that integrate prior knowledge, such as contrast information, class labels, or even only loosely associated observations. Models of this kind include Markov models, such as hidden Markov models (HMM) or Markov random fields (MRF), and extensions of prototype models, such as self-organizing maps (SOM), neural gas (NG), and learning vector quantization (LVQ).

These models are data-driven, i.e. the data space induces a specialized model space for facilitating and focusing the analysis. Problem-specific metric adaptation is one particular and powerful case of this concept, allowing feature rating and feature selection for biomarker detection, as well as improved clustering and classification. Other modeling targets are motif detection in directed data, trustworthy data visualization, and alternative data views that help to overcome limitations of standard statistical methods.

In order to meet the interests of our cooperation partners, the research emphasis is put on the processing of genomic sequences, macroarray and microarray data, and large gelplot collections. Being data-driven, though, the models allow a very broad range of biological applications.
The research group is funded by the Ministery of Culture of Saxony-Anhalt, grant XP 3624HP/0606T.
[ ^ ]
Recent References
201220112010200920082007200620052004
[ ^ ] 2012
[ ^ ] 2011
2011 J. THIEL, H. ROLLETSCHEK, S. FRIEDEL, J.E. LUNN, T.H. NGUYEN, R. FEIL, H. TSCHIERSCH, M. MÜLLER & L. BORISJUK Seed-specific elevation of non-symbiotic hemoglobin AtHb1: beneficial effects and underlying molecular networks in Arabidopsis thaliana. BMC Plant Biology 2011, 11:48
2011 J. KEILWAGEN, J. GRAU, IA. PAPONOV, S. POSCH, M. STRICKERT & I. GROSSE De-Novo Discovery of Differentially Abundant Transcription Factor Binding Sites Including Their Positional Preference. PLoS Comput Biol 7(2): e1001070. doi:10.1371/journal.pcbi.1001070
2011 M. SEIFERT, M. STRICKERT, A. SCHLIEP & I. GROSSE Exploiting prior knowledge and gene distances in the analysis of tumor expression profiles with extended Hidden Markov Models, Bioinformatics, doi: 10.1093/bioinformatics/btr199.
2011 A. BANAEI MOGHADDAM, F. ROUDIER, M. SEIFERT, C. BERAND, M.-L. MARTIN MAGNIETTE, R. KARIMI, A. HOUBEN, V. COLOT, M. F. METTE  Additive inheritance of histone modifications in Arabidopsis thaliana intraspecific hybrids , The Plant Journal, 2011, in press.
2011 HOTZE, M., S. FRIEDEL
dCAIoptimizer: A new tool for advanced codon optimization                   
https://www.omnifung.hki-jena.de/Rpad/dCAIoptimizer.Rpad.
2011 KEILWAGEN, J., J. GRAU, A. GOHR, B. HALDEMANN, M. SEIFERT, M.MOHR, S. POSCH & I.GROSSE
A Java framework for statistical analysis and classification of biological sequences.
http://www.jstacs.de.
2011 KEILWAGEN,J., J. GRAU, I.A. PAPONOV, S. POSCH, M. STRICKERT & I. GROSSE
Dispom: De-Novo Discovery of Differentially Abundant Transcription Factor Binding Sites Including Their Positional Preference. http://www.jstacs.de/index.php/Dispom.
2011 SEIFERT,M., M. STRICKERT, A. SCHLIEP & GROSSE, I.
Exploiting prior knowledge and gene distances in the analysis of tumor expression profiles with extended Hidden Markov Models. http://www.jstacs.de/index.php/DSHMM.
[ ^ ] 2010
2010 C. KALETA, A. GOEHLER, S. SCHUSTER, K. JAHREIS, R. GUTHKE & S. NIKOLAJEWA Integrative Inference of Gene-Regulatory Networks in Escherichia coli Using Information Theoretic Concepts and Sequence Analysis. BMC Systems Biology  4 pp. 116.
2010 R. SINHA, T. LENSER, N. JAHN, U. GAUSMANN, S. FRIEDEL, K. SZAFRANSKI, K. HUSE, P. ROSENSTIEL, J. HAMPE, S. SCHUSTER, M. HILLER, R. BACKOFEN & M. PLATZER TassDB2 - A comprehensive database of subtle alternative splicing events.
BMC Bioinformatics 11 pp. 216.
2010 J. KEILWAGEN, J. GRAU, S. POSCH & I. GROSSE Apples and oranges: avoiding different priors in Bayesian DNA sequence analysis.
BMC Bioinformatics 11(1) pp. 149
2010 J. KEILWAGEN, J. GRAU, S. POSCH, M. STRICKERT & I. GROSSE Unifying generative and discriminative learning principles.                        
BMC Bioinformatics 11(1) pp. 98. PMID: 20175896.
2010 S. POSCH, J. GRAU, A. GOHR, J. KEILWAGEN & I. GROSSE Probabilistic Approaches to Transcription Factor Binding Site Prediction. In: Computational Biology of Transcription Factor Binding.
Series: Methods in Molecular Biology 674, Ladunga, Istvan (Ed.) 415 p. ISBN: 978-1-60761-853-9.
2010 N. SREENIVASULU, R. SUNKAR, U. WOBUS & M. STRICKERT Array Platforms and Bioinformatics Tools for the Analysis of Plant Transcriptome in Response to Abiotic Stress.
In Plant Stress Tolerance - Methods in Molecular Biology. Humana Press 639: 71-93.
2010 WEICHERT, N., I. SAALBACH, H. WEICHERT, S. KOHL, A. ERBAN, J. KOPKA, B. HAUSE, A VARSHNEY, N. SREENIVASULU, M. STRICKERT, J. KUMLEHN, W. WESCHKE, H. WEBER
Increasing sucrose uptake capacity of wheat grains stimulates starage protein synthesis.
Plant Physiol. 152(2): 698-710.
2010 KEILWAGEN, J., J. GRAU, A. GOHR, B. HALDEMANN, M. SEIFERT, M. MOHR, S. POSCH, I. GROSSE
A Java framework for statistical analysis and classification of biological sequences.
http://www.jstacs.de
2010 SINHA, R., T. LENSER, N. JAHN, U. GAUSMANN, S. FRIEDEL, K. SZAFRANSKI, K. HUSE, P. ROSENSTIEL, J. HAMPE, S. SCHUSTER, M. HILLER, R. BACKOFEN, M. PLATZER
TassDB2 - A comprehensive database of subtle alternative splicing events. BMC Bioinformatics 211, 216. PMID: 20429909.
2010 SINHA, R., T. LENSER, N. JAHN, U. GAUSMANN, S. FRIEDEL, K. SZAFRANSKI, K. HUSE, P. ROSENSTIEL, J. HAMPE, S. SCHUSTER, M. HILLER, R. BACKOFEN, M. PLATZER
Tandem Splice Site Database, TassDB2.
http://gen100.imb-jena.de/TassDB2/.
[ ^ ] 2009
2009 M. SEIFERT, J. KEILWAGEN, M. STRICKERT & I. GROSSE
Utilizing gene pair orientations for HMM-based analysis of promoter array ChIP-chip data.
Bioinformatics 25(16): 2118-2125. doi:0.1093/bioinformatics/btp276.
 
2009 M. SEIFERT, A. BANAEI, J. KEILWAGEN, M.F. METTE, A. HOUBEN, F. ROUDIER, V. COLOT, I. GROSSE & M. STRICKERT
Array-based Genome Comparison of Arabidopsis Ecotypes Using Hidden Markov Models.
Proc. 2nd International Conference on Bio-inspired Systems and Signal Processing (BIOSIGNALS). pp. 3-11.
 
2009 J. KEILWAGEN, J. BAUMBACH, T. KOHL & I. GROSSE MotifAdjuster: a tool for computational reassessment of transcription factor binding site annotations.
Genome Biology 10(5) doi:10.1186/gb-2009-10-5-r46.
2009 M. STRICKERT, J. KEILWAGEN, F. -M. SCHLEIF, T. VILLMANN & M. BIEHL Matrix Metric Adaptation Linear Discriminant Analysis of Biomedical Data. Bio-Inspired Systems: Computational and Ambient Intelligence.
Springer Lecture Notes in Computer Science. LNCS 5517. pp. 933-940.
2009 M. STRICKERT, F.-M. SCHLEIF, T. VILLMANN & U. SEIFFERT Unleashing Pearson Correlation for Faithful Analysis of Biomedical Data. In: Similarity-Based Clustering - Recent Developments and Biomedical Applications.
Springer Lecture Notes in Computer Science. LNCS 5400. pp. 70-91.
2009 M. STRICKERT, A. SOTO, J. KEILWAGEN & G.E. VAZQUEZ Towards matrix-based selection of feature pairs for efficient ADMET prediction.
Proceedings of the 10th Argentine Symposium on Artificial Intelligence ASAI 2009. pp. 83-94.
[ ^ ] 2008
2008 J. THIEL, D. WEIER, N. SREENIVASULU, M. STRICKERT, N. WEICHERT, M. MELZER, T. CZAUDERNA, U. WOBUS, H. WEBER & W. WESCHKE
Different hormonal regulation of cellular differentiation and function in nucellar projection and endosperm transfer cells – a microdissection-based transcriptome study of young barley grains.
Plant Physiology 148: 1436-1452.
2008 N. SREENIVASULU, B. USADEL, A. WINTER, V. RADCHUK, U. SCHOLZ, N. STEIN, W. WESCHKE, M. STRICKERT, T.J. CLOSE, M. STITT, A. GRANER & U. WOBUS Barley grain maturation and germination: Metabolic pathway and regulatory network commonalities and differences highlighted by new MapMan/PageMan profiling tools.
Plant Physiol. 146: 1738-1758.
2008 M. STRICKERT, F.-M. SCHLEIF, U. SEIFFERT & T. VILLMANN
Derivatives of Pearson Correlation for Gradient-based Analysis of Biomedical Data.
Revista Iberoamericana de Inteligencia Artificial 12 (37): 37-44.
 
2008 M. STRICKERT, K. WITZEL, J. KEILWAGEN, H.-P. MOCK, P. SCHNEIDER & M. BIEHL
Adaptive matrix metrics for attribute dependence analysis in differential high-throughput data.
Proc. 5th International Workshop on Computational Systems Biology (WCSB), TICSP series 41: 181-184.
 
2008 M. STRICKERT, N. SREENIVASULU, T. VILLMANN & B. HAMMER
Robust centroid-based clustering using derivatives of Pearson correlation.
Proc. International Conference on Biomedical Engineering Systems and Technologies. INSTICC Publications: 197-203.
 
2008 M. STRICKERT, P. SCHNEIDER, J. KEILWAGEN, T. VILLMANN, M. BIEHL & B. HAMMER
Discriminatory data mapping by matrix-based supervised learning metrics.
Lecture Notes in Computer Science, LNCS 5065: 78-89.
 
2008 M. STRICKERT, F.-M. SCHLEIF & T. VILLMANN
Metric adaptation for supervised attribute rating.
European Symposium on Artificial Neural Networks (ESANN), D-facto Publications: 31-36.
 
[ ^ ] 2007
2007 STRICKERT, M., N. SREENIVASULU, B. USADEL & U. SEIFFERT
Correlation-maximizing surrogate gene space for visual mining of gene expression patterns in developing barley endosperm tissue.
BMC Bioinformatics 22;8(1): 165.
2007 STRICKERT, M. & U. SEIFFERT Online proceedings of the Dagstuhl Seminar 'Similarity-based Clustering and its Application to Medicine and Biology'.
2007 VILLMANN, T., M. STRICKERT, C. BRÜß, F.-M. SCHLEIF & U. SEIFFERT Visualization of Fuzzy Information in Fuzzy-Classification for Image Segmentation using MDS. Proc. of the European Symp. on Artificial Neural Networks (ESANN). D-Side publishers Evere/Belgium, pp. 103-107.
2007 HAMMER, B., A. HASENFUß, F.-M. SCHLEIF, T. VILLMANN & M. STRICKERT
Intuitive Clustering of Biological Data.
Proc. of the International Joint Conference on Artificial Neural Networks (IJCNN), ISSN 1-4244-1380-X.
2007 HAMMER, B., A. HASENFUß, F. ROSSI & M. STRICKERT
Topographic Processing of Relational Data.
Online proceedings of the International Workshop on Self-Organizing Maps (WSOM), ISBN 978-3-00-022473-7.
2007 STRICKERT, M., F.-M. SCHLEIF & U. SEIFFERT
Gradients of Pearson Correlation for the Analysis of Biomedical Data.
Proc. of the Argentine Symp. on Artificial Intelligence (ASAI), pp. 139-150, ISSN 1850-2784.
2007 STRICKERT, M., K. WITZEL, H.-P. MOCK, F.-M. SCHLEIF & T. VILLMANN Supervised Attribute Relevance Determination for Protein Identification in Stress Experiments. Proceedings of Machine Learning in Systems Biology (MLSB).
2007 HAMMER, B., A. HASENFUß, F. ROSSI & M. STRICKERT
Topographic Processing of Relational Data. The 6th International Workshop on Self-Organizing Maps (WSOM), ISBN 978-3-00-022473-7
2007 VILLMANN, T., F.-M. SCHLEIF, E. MERENYI, M. STRICKERT & B. HAMMER
Class imaging of hyperspectral satellite remote sensing data using FLSOM. The 6th International Workshop on Self-Organizing Maps (WSOM), ISBN 978-3-00-022473-7
2007 WESCHKE, W., H.-P MOCK, C. PIETSCH, V. RADCHUK, M.S. RÖDER, F. SCHREIBER, U. SEIFFERT, N. SREENIVASULU, M. STRICKERT, K. WITZEL & U. WOBUS "Genetical Genomics" der Gerstenkornentwicklung - von der Genexpression zu landwirtschaftlich bedeutsamen Merkmalen. GenomXPress 1: 12-16.
[ ^ ] 2006
[ ^ ] 2005
[ ^ ] 2004
[ ^ ]
Staff
scientific staff
Friedel, Dr. Swetlana +49 (0)39482 5182
Keilwagen, Dr. Jens +49 (0)39482 5188
Seifert, Dr. Michael +49 (0)39482 5336
staff or visitors
[ ^ ]
Interesting Links
http://dig.ipk-gatersleben.de/
Data Inspection Group Website 
[ ^ ]