Similar Articles |
|
D-Lib March 2006 Gregory Crane |
What Do You Do with a Million Books? The ability to extract from the stored record of humanity useful information in an actionable format for any given human being of any culture at any time and in any place will not emerge quickly, but the fundamental tools on which such a system would be built are moving forward. |
D-Lib March 2006 Schibel & Rydberg-Cox |
Early Modern Culture in a Comprehensive Digital Library Digital libraries have the potential to transform fields such as early modern studies, where problems of physical access to sources and intellectual access to their contents have hampered our ability to contemplate major topics. |
D-Lib Sep/Oct 2011 Gauthereau-Bryson et al. |
Digitization Practices for Translations: Lessons Learned from the Our Americas Archive Partnership Project This paper discusses the complexities involved in digitizing multilingual historical documents, including practices for creating "born-digital" translations and unique metadata to best describe these rare, primary documents. |
D-Lib Mar/Apr 2014 George V. Landon |
Report on the 2nd International Workshop on Historical Document Imaging and Processing (HIP'13) Technical areas covered in the workshop included information extraction and retrieval; reconstruction and degradation; text and image recognition and segmentation; and layout analysis and databases. |
D-Lib Mar/Apr 2009 George V. Landon |
Toward Digitizing All Forms of Documentation Techniques to digitize numerous forms of documentation, including deteriorated manuscripts and photography. |
D-Lib Jul/Aug 2015 Lorang et al. |
Developing an Image-Based Classifier for Detecting Poetic Content in Historic Newspaper Collections The Image Analysis for Archival Discovery (Aida) project team is investigating the use of image analysis to identify poetic content in historic newspapers. |
D-Lib Nov/Dec 2015 Francopoulo et al. |
NLP4NLP: The Cobbler's Children Won't Go Unshod Understanding current trends is a challenging and attractive text mining task, especially when suitable tools are recursively applied to publications from the very domain they come from. |
D-Lib Jul/Aug 2012 Bertin & Atanassova |
Semantic Enrichment of Scientific Publications and Metadata Our aim is to bring new value to scientific publications by automatic extraction and semantic analysis. |
D-Lib February 2001 G. Sayeed Choudhury |
Strike Up the Score Deriving Searchable and Playable Digital Formats from Sheet Music... |
D-Lib August 2009 Tanner et al. |
Measuring Mass Text Digitization Quality and Usefulness Lessons Learned from Assessing the OCR Accuracy of the British Library's 19th Century Online Newspaper Archive |
D-Lib Jul/Aug 2000 Gregory Crane |
Designing Documents to Enhance the Performance of Digital Libraries: Time, Space, People and a Digital Library on London In a mature digital library (DL), documents should coexist with a Geographic Information System (GIS). |
D-Lib Nov/Dec 2015 Tkaczyk et al. |
Structured Affiliations Extraction from Scientific Literature CERMINE is a comprehensive open source system for extracting structured metadata from scientific articles in a born-digital form. |
D-Lib February 2008 Edwin Klijn |
The Current State-of-Art in Newspaper Digitization: A Market Perspective The market and technology adjust to accommodate the trend of newspaper collection digitization by libraries. |
D-Lib Jul/Aug 2012 Herrmannova & Knoth |
Visual Search for Supporting Content Exploration in Large Document Collections Users now demand better support for exploring document collections to discover connections, compare and contrast information. |
D-Lib Jul/Aug 2014 DeRidder & Matheny |
What Do Researchers Need? Feedback On Use of Online Primary Source Materials A qualitative study of 11 humanities faculty researchers at the University of Alabama, describes and rates the importance of various issues encountered when using 29 participant-selected online databases. |
D-Lib Nov/Dec 2014 Klampfl et al. |
A Comparison of Two Unsupervised Table Recognition Methods from Digital Scientific Articles In this paper we present two table recognition methods based on unsupervised learning techniques and heuristics which automatically detect both the location and the structure of tables within a article stored as PDF. |
Information Today September 2, 2010 |
IBM and the EU Collaborate on Digitization of Historic European Texts The project seeks to provide technology that will enable highly-accurate digitization of rare and culturally significant historical texts on a massive scale. |
D-Lib December 2002 Dagobert Soergel |
A Framework for Digital Library Research: Broadening the Vision Digital library research and development needs a framework that can be used as a perspective on existing research and practice and, more importantly, as a structured vision for the development of new ideas. |
D-Lib Mar/Apr 2009 Rose Holley |
How Good Can It Get? Analysing and Improving OCR Accuracy in Large Scale Historic Newspaper Digitization Programs This article details the work undertaken by the National Library of Australia Newspaper Digitisation Program on identifying and testing solutions to improve OCR accuracy in large scale newspaper digitisation programs. |
D-Lib Nov/Dec 2015 Frey & Kern |
Efficient Table Annotation for Digital Articles Table recognition and table extraction are important tasks in information extraction, especially in the domain of scholarly communication. |
D-Lib January 2002 Suzana Sukovic |
Beyond the scriptorium: The Role of the Library in Text Encoding Development of electronic textual resources means dealing with documents in new ways and on different levels, often involving work on a document's content through text encoding. This development challenges the library's assumed position in the research process... |
D-Lib January 2005 |
In Brief The Gamera Software Development Kit for Document Image Analysis... XML Techniques for the Representation and Interchange of Thesaurus Data... The UTOPIA Project... In the News... etc. |
D-Lib March 2006 David A. Smith |
Debabelizing Libraries: Machine Translation by and for Digital Collections Million-book libraries provide not only testbeds for existing ideas, but also several problems in need of immediate solution. As data acquisition becomes more automated, cataloguing needs more automated help. |
D-Lib Nov/Dec 2009 Cassella & Calvi |
ECDL 2009 Enhancing digital libraries users' experience. |
D-Lib March 2006 Daniel J. Cohen |
From Babel to Knowledge: Data Mining Large Digital Collections High-quality digitization and thorough text markup may be attractive for those creating digital collections, but a familiarity with information theory and data-mining techniques makes one realize that it may be more worthwhile to digitize a greater number of books or documents at a lower standard for the same cost. |
D-Lib May/Jun 2012 Westbrook et al. |
Metadata Clean Sweep: A Digital Library Audit Project This paper discusses the pilot of an ongoing digital library metadata audit project that was collaboratively launched by library school interns and full-time staff to alleviate poor recall, poor precision and metadata inconsistencies across digital collections. |
D-Lib January 2000 Dan Huttenlocher & Angela Moll |
On DigiPaper and the Dissemination of Electronic Documents Proposal for a new image-based document representation, called DigiPaper, which is designed to easily disseminate electronic documents with a guaranteed appearance. DigiPaper's compression performance is analyzed. |
Information Today November 6, 2008 |
Automatic Categorization Added in Hot Neuron's Clustify 2.0 Hot Neuron, LLC announced the release of version 2.0 of its Clustify document clustering software, which features automatic document categorization and other tools to help corporations and law firms explore and organize large document sets. |
PC Magazine February 8, 2008 John C. Dvorak |
Computing's Final Frontiers The ultimate in machine translation is the gadget that translates what you say and speaks it in a foreign language. I am certain that the smart money has long since bailed out of these projects. |
D-Lib October 2001 Ian H. Witten |
Greenstone: Open-Source Digital Library Software The Greenstone digital library software is an open-source system for the construction and presentation of information collections. It builds collections with effective full-text searching and metadata-based browsing facilities that are attractive and easy to use... |
D-Lib March 2001 |
In Brief Award to Penn State University Libraries to support an extensive study of digital image delivery... Digitization of Printed Material: The METAe Project... The Special Collections Virtual Reading Room... etc. |
D-Lib March 2002 |
Clips and Pointers Executive Summary of the DigiCULT Study Technological Landscapes for Tomorrow's Cultural Economy... Conclusions from the Text-e Virtual Symposium... Point to Point... Calls for Participation... etc. |
D-Lib September 2002 Schmidt et al. |
Building Digital Tobacco Industry Document Libraries Few digital libraries begin with the drama that accompanied creation of the University of California San Francisco's (UCSF) Tobacco Control Archives. UCSF's extensive digital collections of tobacco industry documents began in 1993 with an anonymous donation of documents. |
Information Today June 27, 2011 Barbara Quint |
The British Library Joins Google Books Google Books continues its march through the national libraries of Europe with the announcement of a deal with the British Library. |
PC World August 23, 2006 Richard Jantz |
IRIS Serves Up Snappy, Accurate OCR Update of optical character recognition program could boost your office's productivity. |
InternetNews February 5, 2009 Judy Mottl |
Google Mobilizes Its Book Collection Google takes initial step to bringing its library to mobile readership. |
D-Lib February 2001 Manfred Thaller |
From the Digitized to the Digital Library Many, if not most, digitization projects have aimed at existing collections as individual servers. A digital library, however, should be more than a digitized one... |
D-Lib Jul/Aug 2000 Thomas A. Phelps & Robert Wilensky |
Robust Hyperlinks and Locations We suggest that building "permissive, but robust" digital library systems and services is an attractive alternative to the library and computer science tradition of building "strict, but fragile" systems. |
PC Magazine September 29, 2005 M. David Stone |
OmniPage Professional 15 ScanSoft's OmniPage Professional 15 takes OCR programs into new territory. |
Macworld March 2004 Christopher Breen |
Readiris Pro 9 OCR Application Offers Improved Accuracy, Has Some Quirks |
Information Today March 3, 2008 |
Hot Neuron Introduces Document Clustering Software Hot Neuron announced the release of version 1.0 of its Clustify document clustering software, aimed at helping corporations and law firms explore, organize, and tag large document sets. |
Macworld August 17, 2007 Jeffery Battersby |
Pages '08 Apple's new Pages '08 is a very good word processing and page layout program with dozens of new features and enhancements, but, it is still missing key features -- and has one bug that may prevent you from taking the plunge. |
Information Today May 23, 2011 |
National Library of France Embarks on Huge Digitization Project The BnF, the National Library of France, has signed a new deal with the Jouve-Safig-Diadeis partnership for the digitization of its print collections. |
Information Today November 8, 2012 |
ProQuest Participates in Early Modern OCR Project The company will provide access to page images from the veritable Early English Books Online and newcomer Early European Books to the Early Modern OCR Project (eMOP) at Texas A&M. |
PC Magazine September 13, 2006 M. David Stone |
Buying Guide: Document Scanners One of the benefits of creating text-based documents on a computer is that it's easy to find the documents again. |
Financial Advisor March 2010 David Lawrence |
A Key To Efficiency Advisors today have many choices when it comes to document management software. |
Financial Advisor February 2010 Joel P. Bruckenstein |
One For The Short List Document management system Image Executive allows advisors to operate more productively, efficiently and securely. |
PC Magazine June 2, 2008 Neil J. Rubenking |
Eight Handy Tools in Microsoft Word You Probably Don't Know About Save time and energy by using these easy features in your Word documents. |
CIO January 15, 2002 Simone Kaplan |
Management by Any Other Name? The line between the document management and content management markets is growing fuzzier by the day... |
PC Magazine November 14, 2007 Neil J. Rubenking |
Stripping Out Metadata in Word Want to remove your name and other personal info from a Word document? Here's a simple trick to do just that. |