Description : This book provides an overview of the state of the art in research and development of systems for document image analysis. Topics covered include a variety of systems and architectures for processing document images as well as methods for converting those images into formats that can be manipulated by a computer. The chapters are written by recognized experts in the field and describe Systems and Architectures, Recognition Techniques, Graphics Analysis, Document Image Retrieval, and World Wide Web Applications.
Description : Thisvolumecontainspapersselectedforpresentationatthe6thIAPRWorkshop on Document Analysis Systems (DAS 2004) held during September 8–10, 2004 at the University of Florence, Italy. Several papers represent the state of the art in a broad range of “traditional” topics such as layout analysis, applications to graphics recognition, and handwritten documents. Other contributions address the description of complete working systems, which is one of the strengths of this workshop. Some papers extend the application domains to other media, like the processing of Internet documents. The peculiarity of this 6th workshop was the large number of papers related to digital libraries and to the processing of historical documents, a taste which frequently requires the analysis of color documents. A total of 17 papers are associated with these topics, whereas two yearsago (in DAS 2002) only a couple of papers dealt with these problems. In our view there are three main reasons for this new wave in the DAS community. From the scienti?c point of view, several research ?elds reached a thorough knowledge of techniques and problems that can be e?ectively solved, and this expertise can now be applied to new domains. Another incentive has been provided by several research projects funded by the EC and the NSF on topics related to digital libraries.
Description : Recently, there has been an increased interest in the research and development of techniques for components of complete document analysis systems. In recognition of this trend, a series of workshops on Document Analysis Systems commenced in 1994, under the leadership of Henry Baird. The first workshop, held in Kaiserslautern, Germany, in October, 1994, was chaired by Andreas Dengel and Larry Spitz. The second workshop on Document Analysis Systems was held in Malvern, PA, USA, in October, 1996, chaired by Jonathan J. Hull and Suzanne Liebowitz Taylor. The DAS workshop has been one of the most prestigious technical meetings, bringing together a large number of scientists and engineers from all over the world to express their innovative ideas and report on their latest achievements in the area of document analysis systems. The papers in this special book edition were rigorously selected from the Third IAPR Workshop on Document Analysis Systems (DAS’98), held in Nagano, Japan, on 4 - 6 November 1998. It is worth mentioning that the papers were chosen for their original and substantial contributions to the workshop theme and this special book edition. From among the 53 papers that were presented by authors from 11 countries at the DAS’98 after critical reviews by at least three experts, we carefully selected 29 papers for this special book edition. Most of the contributions in this edition have been expanded or extensively revised to include helpful discussions, suggestions, or comments made during the workshop.
Description : This book constitutes the refereed proceedings of the 5th International Workshop on Document Analysis Systems, DAS 2002, held in Princeton, NJ, USA in August 2002 with sponsorship from IAPR.The 44 revised full papers presented together with 14 short papers were carefuly reviwed and selected for inclusion in the book. All current issues in document analysis systems are adressed. The papers are organized in topical sections on OCR features and systems, handwriting recognition, layout analysis, classifiers and learning, tables and forms, text extraction, indexing and retrieval, document engineering, and new applications.
Description : This book constitutes the refereed proceedings of the 7th International Conference on Document Analysis Systems, DAS 2006, held in Nelson, New Zealand, in February 2006. The 33 revised full papers and 22 poster papers presented were carefully reviewed and selected from 78 submissions. The papers are organized in topical sections on digital libraries, image processing, handwriting, document structure and format, tables, language and script identification, systems and performance evaluation, and retrieval and segmentation.
Description : The compendium presents the latest results of the most prominent competitions held in the field of Document Analysis and d104 Recognition. It includes a description of the participating systems and the underlying methods on one hand and the datasets used together with evaluation metrics on the other hand. This volume also demonstrates with examples, how to organize a competition and how to make it successful. It will be an indispensable handbook to the document image analysis community.
Description : This book constitutes the refereed proceedings of the First International Workshop on Multiple Classifier Systems, MCS 2000, held in Cagliari, Italy in June 2000. The 33 revised full papers presented together with five invited papers were carefully reviewed and selected for inclusion in the book. The papers are organized in topical sections on theoretical issues, multiple classifier fusion, bagging and boosting, design of multiple classifier systems, applications of multiple classifier systems, document analysis, and miscellaneous applications.
Description : Basic Concepts of Document Analysis and Understanding; Basic Concepts of Fractal Dimension; Basic Concepts of Wavelet Theory; Document Analysis by Fractal Dimension; Text Extraction by Wavelet Decomposition; Rotation Invariant by Fractal Theory with Central Projection Transform (CPT); Wavelet-Based and Fractal-Based Methods for Script Identification; Writer Identification Using Hidden Markov Model in Wavelet Domain (WD-HMM).
Description : The objective of Document Analysis and Recognition (DAR) is to recognize the text and graphicalcomponents of a document and to extract information. With ?rst papers dating back to the 1960’s, DAR is a mature but still gr- ing research?eld with consolidated and known techniques. Optical Character Recognition (OCR) engines are some of the most widely recognized pr- ucts of the research in this ?eld, while broader DAR techniques are nowadays studied and applied to other industrial and o?ce automation systems. In the machine learning community, one of the most widely known - search problems addressed in DAR is recognition of unconstrained handwr- ten characters which has been frequently used in the past as a benchmark for evaluating machine learning algorithms, especially supervised classi?ers. However, developing a DAR system is a complex engineering task that involves the integration of multiple techniques into an organic framework. A reader may feel that the use of machine learning algorithms is not approp- ate for other DAR tasks than character recognition. On the contrary, such algorithms have been massively used for nearly all the tasks in DAR. With large emphasis being devoted to character recognition and word recognition, other tasks such as pre-processing, layout analysis, character segmentation, and signature veri?cation have also bene?ted much from machine learning algorithms.