The junior research group Data Inspection develops and studies novel computer algorithms for the analysis of biological high-throughput data.
The central goal of biological experiments is the collection of new and meaningful data. Measurements do often reflect a condensate of laborious and costly work concerning experimental design, setup, and execution. Since data analysis is the key to gain information and further knowledge about the subject of interest, faithful data inspection methods are required to extract substantial facts from the measurements.
In addition to the analysis of existing data sets, data inspection aims at creating auxiliary data models that integrate prior knowledge, such as contrast information, class labels, or even only loosely associated observations. Models of this kind include Markov models, such as hidden Markov models (HMM) or Markov random fields (MRF), and extensions of prototype models, such as self-organizing maps (SOM), neural gas (NG), and learning vector quantization (LVQ).
These models are data-driven, i.e. the data space induces a specialized model space for facilitating and focusing the analysis. Problem-specific metric adaptation is one particular and powerful case of this concept, allowing feature rating and feature selection for biomarker detection, as well as improved clustering and classification. Other modeling targets are motif detection in directed data, trustworthy data visualization, and alternative data views that help to overcome limitations of standard statistical methods.
In order to meet the interests of our cooperation partners, the research emphasis is put on the processing of genomic sequences, macroarray and microarray data, and large gelplot collections. Being data-driven, though, the models allow a very broad range of biological applications.
The research group is funded by the Ministery of Culture of Saxony-Anhalt, grant XP 3624HP/0606T.
Exploiting prior knowledge and gene distances in the analysis of tumor expression profiles with extended Hidden Markov Models, Bioinformatics, doi: 10.1093/bioinformatics/btr199.
KEILWAGEN, J., J. GRAU, A. GOHR, B. HALDEMANN, M. SEIFERT, M.MOHR, S. POSCH & I.GROSSE
A Java framework for statistical analysis and classification of biological sequences. http://www.jstacs.de.
2011
KEILWAGEN,J., J. GRAU, I.A. PAPONOV, S. POSCH, M. STRICKERT & I. GROSSE
Dispom: De-Novo Discovery of Differentially Abundant Transcription Factor Binding Sites Including Their Positional Preference. http://www.jstacs.de/index.php/Dispom.
2011
SEIFERT,M., M. STRICKERT, A. SCHLIEP & GROSSE, I.
Exploiting prior knowledge and gene distances in the analysis of tumor expression profiles with extended Hidden Markov Models. http://www.jstacs.de/index.php/DSHMM.
C. KALETA, A. GOEHLER, S. SCHUSTER, K. JAHREIS, R. GUTHKE & S. NIKOLAJEWA
Integrative Inference of Gene-Regulatory Networks in Escherichia coli Using Information Theoretic Concepts and Sequence Analysis. BMC Systems Biology 4 pp. 116.
R. SINHA, T. LENSER, N. JAHN, U. GAUSMANN, S. FRIEDEL, K. SZAFRANSKI, K. HUSE, P. ROSENSTIEL, J. HAMPE, S. SCHUSTER, M. HILLER, R. BACKOFEN & M. PLATZER
TassDB2 - A comprehensive database of subtle alternative splicing events. BMC Bioinformatics 11 pp. 216.
N. SREENIVASULU, R. SUNKAR, U. WOBUS & M. STRICKERT
Array Platforms and Bioinformatics Tools for the Analysis of Plant Transcriptome in Response to Abiotic Stress. In Plant Stress Tolerance - Methods in Molecular Biology. Humana Press 639: 71-93.
WEICHERT, N., I. SAALBACH, H. WEICHERT, S. KOHL, A. ERBAN, J. KOPKA, B. HAUSE, A VARSHNEY, N. SREENIVASULU, M. STRICKERT, J. KUMLEHN, W. WESCHKE, H. WEBER
Increasing sucrose uptake capacity of wheat grains stimulates starage protein synthesis. Plant Physiol. 152(2): 698-710.
KEILWAGEN, J., J. GRAU, A. GOHR, B. HALDEMANN, M. SEIFERT, M. MOHR, S. POSCH, I. GROSSE
A Java framework for statistical analysis and classification of biological sequences. http://www.jstacs.de
2010
SINHA, R., T. LENSER, N. JAHN, U. GAUSMANN, S. FRIEDEL, K. SZAFRANSKI, K. HUSE, P. ROSENSTIEL, J. HAMPE, S. SCHUSTER, M. HILLER, R. BACKOFEN, M. PLATZER
TassDB2 - A comprehensive database of subtle alternative splicing events. BMC Bioinformatics 211, 216. PMID: 20429909.
SINHA, R., T. LENSER, N. JAHN, U. GAUSMANN, S. FRIEDEL, K. SZAFRANSKI, K. HUSE, P. ROSENSTIEL, J. HAMPE, S. SCHUSTER, M. HILLER, R. BACKOFEN, M. PLATZER
M. SEIFERT, A. BANAEI, J. KEILWAGEN, M.F. METTE, A. HOUBEN, F. ROUDIER, V. COLOT, I. GROSSE & M. STRICKERT
Array-based Genome Comparison of Arabidopsis Ecotypes Using Hidden Markov Models. Proc. 2nd International Conference on Bio-inspired Systems and Signal Processing (BIOSIGNALS). pp. 3-11.
MotifAdjuster: a tool for computational reassessment of transcription factor binding site annotations. Genome Biology 10(5) doi:10.1186/gb-2009-10-5-r46.
M. STRICKERT, J. KEILWAGEN, F. -M. SCHLEIF, T. VILLMANN & M. BIEHL
Matrix Metric Adaptation Linear Discriminant Analysis of Biomedical Data. Bio-Inspired Systems: Computational and Ambient Intelligence. Springer Lecture Notes in Computer Science. LNCS 5517. pp. 933-940.
M. STRICKERT, A. SOTO, J. KEILWAGEN & G.E. VAZQUEZ
Towards matrix-based selection of feature pairs for efficient ADMET prediction. Proceedings of the 10th Argentine Symposium on Artificial Intelligence ASAI 2009. pp. 83-94.
J. THIEL, D. WEIER, N. SREENIVASULU, M. STRICKERT, N. WEICHERT, M. MELZER, T. CZAUDERNA, U. WOBUS, H. WEBER & W. WESCHKE
Different hormonal regulation of cellular differentiation and function in nucellar projection and endosperm transfer cells – a microdissection-based transcriptome study of young barley grains. Plant Physiology 148: 1436-1452.
N. SREENIVASULU, B. USADEL, A. WINTER, V. RADCHUK, U. SCHOLZ, N. STEIN, W. WESCHKE, M. STRICKERT, T.J. CLOSE, M. STITT, A. GRANER & U. WOBUS
Barley grain maturation and germination: Metabolic pathway and regulatory network commonalities and differences highlighted by new MapMan/PageMan profiling tools. Plant Physiol. 146: 1738-1758.
M. STRICKERT, K. WITZEL, J. KEILWAGEN, H.-P. MOCK, P. SCHNEIDER & M. BIEHL
Adaptive matrix metrics for attribute dependence analysis in differential high-throughput data. Proc. 5th International Workshop on Computational Systems Biology (WCSB), TICSP series 41: 181-184.
M. STRICKERT, N. SREENIVASULU, T. VILLMANN & B. HAMMER
Robust centroid-based clustering using derivatives of Pearson correlation. Proc. International Conference on Biomedical Engineering Systems and Technologies. INSTICC Publications: 197-203.
VILLMANN, T., M. STRICKERT, C. BRÜß, F.-M. SCHLEIF & U. SEIFFERT
Visualization of Fuzzy Information in Fuzzy-Classification for Image Segmentation using MDS. Proc. of the European Symp. on Artificial Neural Networks (ESANN). D-Side publishers Evere/Belgium, pp. 103-107.
2007
HAMMER, B., A. HASENFUß, F.-M. SCHLEIF, T. VILLMANN & M. STRICKERT
Intuitive Clustering of Biological Data.
Proc. of the International Joint Conference on Artificial Neural Networks (IJCNN), ISSN 1-4244-1380-X.
2007
HAMMER, B., A. HASENFUß, F. ROSSI & M. STRICKERT
Topographic Processing of Relational Data.
Online proceedings of the International Workshop on Self-Organizing Maps (WSOM), ISBN 978-3-00-022473-7.
Gradients of Pearson Correlation for the Analysis of Biomedical Data.
Proc. of the Argentine Symp. on Artificial Intelligence (ASAI), pp. 139-150, ISSN 1850-2784.
2007
STRICKERT, M., K. WITZEL, H.-P. MOCK, F.-M. SCHLEIF & T. VILLMANN
Supervised Attribute Relevance Determination for Protein Identification in Stress Experiments. Proceedings of Machine Learning in Systems Biology (MLSB).
2007
HAMMER, B., A. HASENFUß, F. ROSSI & M. STRICKERT
Topographic Processing of Relational Data. The 6th International Workshop on Self-Organizing Maps (WSOM), ISBN 978-3-00-022473-7
VILLMANN, T., F.-M. SCHLEIF, E. MERENYI, M. STRICKERT & B. HAMMER
Class imaging of hyperspectral satellite remote sensing data using FLSOM. The 6th International Workshop on Self-Organizing Maps (WSOM), ISBN 978-3-00-022473-7