MagPortal.com   Clustify - document clustering
 Home  |  Newsletter  |  My Articles  |  My Account  |  Help 
Similar Articles
D-Lib
March 2006
Gregory Crane
What Do You Do with a Million Books? The ability to extract from the stored record of humanity useful information in an actionable format for any given human being of any culture at any time and in any place will not emerge quickly, but the fundamental tools on which such a system would be built are moving forward. mark for My Articles similar articles
D-Lib
March 2006
Schibel & Rydberg-Cox
Early Modern Culture in a Comprehensive Digital Library Digital libraries have the potential to transform fields such as early modern studies, where problems of physical access to sources and intellectual access to their contents have hampered our ability to contemplate major topics. mark for My Articles similar articles
D-Lib
Sep/Oct 2011
Gauthereau-Bryson et al.
Digitization Practices for Translations: Lessons Learned from the Our Americas Archive Partnership Project This paper discusses the complexities involved in digitizing multilingual historical documents, including practices for creating "born-digital" translations and unique metadata to best describe these rare, primary documents. mark for My Articles similar articles
D-Lib
Mar/Apr 2014
George V. Landon
Report on the 2nd International Workshop on Historical Document Imaging and Processing (HIP'13) Technical areas covered in the workshop included information extraction and retrieval; reconstruction and degradation; text and image recognition and segmentation; and layout analysis and databases. mark for My Articles similar articles
D-Lib
Mar/Apr 2009
George V. Landon
Toward Digitizing All Forms of Documentation Techniques to digitize numerous forms of documentation, including deteriorated manuscripts and photography. mark for My Articles similar articles
D-Lib
Jul/Aug 2015
Lorang et al.
Developing an Image-Based Classifier for Detecting Poetic Content in Historic Newspaper Collections The Image Analysis for Archival Discovery (Aida) project team is investigating the use of image analysis to identify poetic content in historic newspapers. mark for My Articles similar articles
D-Lib
Nov/Dec 2015
Francopoulo et al.
NLP4NLP: The Cobbler's Children Won't Go Unshod Understanding current trends is a challenging and attractive text mining task, especially when suitable tools are recursively applied to publications from the very domain they come from. mark for My Articles similar articles
D-Lib
Jul/Aug 2012
Bertin & Atanassova
Semantic Enrichment of Scientific Publications and Metadata Our aim is to bring new value to scientific publications by automatic extraction and semantic analysis. mark for My Articles similar articles
D-Lib
February 2001
G. Sayeed Choudhury
Strike Up the Score Deriving Searchable and Playable Digital Formats from Sheet Music... mark for My Articles similar articles
D-Lib
August 2009
Tanner et al.
Measuring Mass Text Digitization Quality and Usefulness Lessons Learned from Assessing the OCR Accuracy of the British Library's 19th Century Online Newspaper Archive mark for My Articles similar articles
D-Lib
Jul/Aug 2000
Gregory Crane
Designing Documents to Enhance the Performance of Digital Libraries: Time, Space, People and a Digital Library on London In a mature digital library (DL), documents should coexist with a Geographic Information System (GIS). mark for My Articles similar articles
D-Lib
Nov/Dec 2015
Tkaczyk et al.
Structured Affiliations Extraction from Scientific Literature CERMINE is a comprehensive open source system for extracting structured metadata from scientific articles in a born-digital form. mark for My Articles similar articles
D-Lib
February 2008
Edwin Klijn
The Current State-of-Art in Newspaper Digitization: A Market Perspective The market and technology adjust to accommodate the trend of newspaper collection digitization by libraries. mark for My Articles similar articles
D-Lib
Jul/Aug 2012
Herrmannova & Knoth
Visual Search for Supporting Content Exploration in Large Document Collections Users now demand better support for exploring document collections to discover connections, compare and contrast information. mark for My Articles similar articles
D-Lib
Jul/Aug 2014
DeRidder & Matheny
What Do Researchers Need? Feedback On Use of Online Primary Source Materials A qualitative study of 11 humanities faculty researchers at the University of Alabama, describes and rates the importance of various issues encountered when using 29 participant-selected online databases. mark for My Articles similar articles
D-Lib
Nov/Dec 2014
Klampfl et al.
A Comparison of Two Unsupervised Table Recognition Methods from Digital Scientific Articles In this paper we present two table recognition methods based on unsupervised learning techniques and heuristics which automatically detect both the location and the structure of tables within a article stored as PDF. mark for My Articles similar articles
Information Today
September 2, 2010
IBM and the EU Collaborate on Digitization of Historic European Texts The project seeks to provide technology that will enable highly-accurate digitization of rare and culturally significant historical texts on a massive scale. mark for My Articles similar articles
D-Lib
December 2002
Dagobert Soergel
A Framework for Digital Library Research: Broadening the Vision Digital library research and development needs a framework that can be used as a perspective on existing research and practice and, more importantly, as a structured vision for the development of new ideas. mark for My Articles similar articles
D-Lib
Mar/Apr 2009
Rose Holley
How Good Can It Get? Analysing and Improving OCR Accuracy in Large Scale Historic Newspaper Digitization Programs This article details the work undertaken by the National Library of Australia Newspaper Digitisation Program on identifying and testing solutions to improve OCR accuracy in large scale newspaper digitisation programs. mark for My Articles similar articles
D-Lib
Nov/Dec 2015
Frey & Kern
Efficient Table Annotation for Digital Articles Table recognition and table extraction are important tasks in information extraction, especially in the domain of scholarly communication. mark for My Articles similar articles
D-Lib
January 2002
Suzana Sukovic
Beyond the scriptorium: The Role of the Library in Text Encoding Development of electronic textual resources means dealing with documents in new ways and on different levels, often involving work on a document's content through text encoding. This development challenges the library's assumed position in the research process... mark for My Articles similar articles
D-Lib
January 2005
In Brief The Gamera Software Development Kit for Document Image Analysis... XML Techniques for the Representation and Interchange of Thesaurus Data... The UTOPIA Project... In the News... etc. mark for My Articles similar articles
D-Lib
March 2006
David A. Smith
Debabelizing Libraries: Machine Translation by and for Digital Collections Million-book libraries provide not only testbeds for existing ideas, but also several problems in need of immediate solution. As data acquisition becomes more automated, cataloguing needs more automated help. mark for My Articles similar articles
D-Lib
Nov/Dec 2009
Cassella & Calvi
ECDL 2009 Enhancing digital libraries users' experience. mark for My Articles similar articles
D-Lib
March 2006
Daniel J. Cohen
From Babel to Knowledge: Data Mining Large Digital Collections High-quality digitization and thorough text markup may be attractive for those creating digital collections, but a familiarity with information theory and data-mining techniques makes one realize that it may be more worthwhile to digitize a greater number of books or documents at a lower standard for the same cost. mark for My Articles similar articles
D-Lib
May/Jun 2012
Westbrook et al.
Metadata Clean Sweep: A Digital Library Audit Project This paper discusses the pilot of an ongoing digital library metadata audit project that was collaboratively launched by library school interns and full-time staff to alleviate poor recall, poor precision and metadata inconsistencies across digital collections. mark for My Articles similar articles
D-Lib
January 2000
Dan Huttenlocher & Angela Moll
On DigiPaper and the Dissemination of Electronic Documents Proposal for a new image-based document representation, called DigiPaper, which is designed to easily disseminate electronic documents with a guaranteed appearance. DigiPaper's compression performance is analyzed. mark for My Articles similar articles
Information Today
November 6, 2008
Automatic Categorization Added in Hot Neuron's Clustify 2.0 Hot Neuron, LLC announced the release of version 2.0 of its Clustify document clustering software, which features automatic document categorization and other tools to help corporations and law firms explore and organize large document sets. mark for My Articles similar articles
PC Magazine
February 8, 2008
John C. Dvorak
Computing's Final Frontiers The ultimate in machine translation is the gadget that translates what you say and speaks it in a foreign language. I am certain that the smart money has long since bailed out of these projects. mark for My Articles similar articles
D-Lib
October 2001
Ian H. Witten
Greenstone: Open-Source Digital Library Software The Greenstone digital library software is an open-source system for the construction and presentation of information collections. It builds collections with effective full-text searching and metadata-based browsing facilities that are attractive and easy to use... mark for My Articles similar articles
D-Lib
March 2001
In Brief Award to Penn State University Libraries to support an extensive study of digital image delivery... Digitization of Printed Material: The METAe Project... The Special Collections Virtual Reading Room... etc. mark for My Articles similar articles
D-Lib
March 2002
Clips and Pointers Executive Summary of the DigiCULT Study Technological Landscapes for Tomorrow's Cultural Economy... Conclusions from the Text-e Virtual Symposium... Point to Point... Calls for Participation... etc. mark for My Articles similar articles
D-Lib
September 2002
Schmidt et al.
Building Digital Tobacco Industry Document Libraries Few digital libraries begin with the drama that accompanied creation of the University of California San Francisco's (UCSF) Tobacco Control Archives. UCSF's extensive digital collections of tobacco industry documents began in 1993 with an anonymous donation of documents. mark for My Articles similar articles
Information Today
June 27, 2011
Barbara Quint
The British Library Joins Google Books Google Books continues its march through the national libraries of Europe with the announcement of a deal with the British Library. mark for My Articles similar articles
PC World
August 23, 2006
Richard Jantz
IRIS Serves Up Snappy, Accurate OCR Update of optical character recognition program could boost your office's productivity. mark for My Articles similar articles
InternetNews
February 5, 2009
Judy Mottl
Google Mobilizes Its Book Collection Google takes initial step to bringing its library to mobile readership. mark for My Articles similar articles
D-Lib
February 2001
Manfred Thaller
From the Digitized to the Digital Library Many, if not most, digitization projects have aimed at existing collections as individual servers. A digital library, however, should be more than a digitized one... mark for My Articles similar articles
D-Lib
Jul/Aug 2000
Thomas A. Phelps & Robert Wilensky
Robust Hyperlinks and Locations We suggest that building "permissive, but robust" digital library systems and services is an attractive alternative to the library and computer science tradition of building "strict, but fragile" systems. mark for My Articles similar articles
PC Magazine
September 29, 2005
M. David Stone
OmniPage Professional 15 ScanSoft's OmniPage Professional 15 takes OCR programs into new territory. mark for My Articles similar articles
Macworld
March 2004
Christopher Breen
Readiris Pro 9 OCR Application Offers Improved Accuracy, Has Some Quirks mark for My Articles similar articles
Information Today
March 3, 2008
Hot Neuron Introduces Document Clustering Software Hot Neuron announced the release of version 1.0 of its Clustify document clustering software, aimed at helping corporations and law firms explore, organize, and tag large document sets. mark for My Articles similar articles
Macworld
August 17, 2007
Jeffery Battersby
Pages '08 Apple's new Pages '08 is a very good word processing and page layout program with dozens of new features and enhancements, but, it is still missing key features -- and has one bug that may prevent you from taking the plunge. mark for My Articles similar articles
Information Today
May 23, 2011
National Library of France Embarks on Huge Digitization Project The BnF, the National Library of France, has signed a new deal with the Jouve-Safig-Diadeis partnership for the digitization of its print collections. mark for My Articles similar articles
Information Today
November 8, 2012
ProQuest Participates in Early Modern OCR Project The company will provide access to page images from the veritable Early English Books Online and newcomer Early European Books to the Early Modern OCR Project (eMOP) at Texas A&M. mark for My Articles similar articles
PC Magazine
September 13, 2006
M. David Stone
Buying Guide: Document Scanners One of the benefits of creating text-based documents on a computer is that it's easy to find the documents again. mark for My Articles similar articles
Financial Advisor
March 2010
David Lawrence
A Key To Efficiency Advisors today have many choices when it comes to document management software. mark for My Articles similar articles
Financial Advisor
February 2010
Joel P. Bruckenstein
One For The Short List Document management system Image Executive allows advisors to operate more productively, efficiently and securely. mark for My Articles similar articles
PC Magazine
June 2, 2008
Neil J. Rubenking
Eight Handy Tools in Microsoft Word You Probably Don't Know About Save time and energy by using these easy features in your Word documents. mark for My Articles similar articles
CIO
January 15, 2002
Simone Kaplan
Management by Any Other Name? The line between the document management and content management markets is growing fuzzier by the day... mark for My Articles similar articles
PC Magazine
November 14, 2007
Neil J. Rubenking
Stripping Out Metadata in Word Want to remove your name and other personal info from a Word document? Here's a simple trick to do just that. mark for My Articles similar articles