Robert H. McDonald edited section_Distant_Publishing_Use_Cases__.tex  almost 8 years ago

Commit id: d96cb7fd8f9e200a0432daed100ebdccef17bad5

deletions | additions      

       

\begin{itemize}  \item \textbf{Extracted Features Worksets} - HTRC expects this concept to be further refined as we move toward the second round of HTRC Advanced Collaborative Support grants which will be funded in summer 2016. Our most progressive case for distant publishing at this point is leveraged through the publication and release of our main extracted features workset. The current workset is a prototype based on the 4.8 million volume public domain collection from the HTDL. In the coming months this workset will be redefined to include more of the HTDL collection. From this initial workset publication we have seen further refinements of the workset by scholars such as Ted Underwood, Colin Allen, and Matthew Wilkens.  \item \textbf{HT+Bookworm} - The HathiTrust+Bookworm (HT+BW) project presents textual content through interactive visualization. Whereas HT+BW has previously been used in standalone contexts with pre-determined metadata, currently HT+BW is enabling scholars to analyze custom personal collections from within the larger corpus and the use of HT+BW as a supplement to other uses of the HTRC. This concept could eventually become a new possibility for derived workset publication in its own right.  \item \textbf{HTRC Workset Ontology} - Currently in development, the HTRC Workset Ontology is part of a collections data model by the Workset Creation for Scholarly Analysis project , a HTRC research initiative funded by the Andrew W. Mellon Foundation. The resulting HTRC Workset data model is designed to aid humanities scholars by helping them to describe selected portions of the HTDL corpus that serve as the objects of their research. The resulting worksets are persistent, citable, and can be assessed by other scholars for reuse in additional research processes.  \end{itemize}