Connecting Librarian

Connecting Librarian

Connecting new ideas and technologies with library service

  • Home
  • About
  • Presentations
  • Publications
  • Web Presence

Marshall Breeding – VALA2010 Day 2 Morning Plenary

Top Trends Panel – VALA2010 Day 2 Afternoon

Feb 11

VALA2010 Concurrent Session 7 – Innovation

  • By Michelle McLean in conference

Warwick Cathro and Susan Collier – Developing Trove: the policy and technical challenges

Trove is a free discovery service for the public. It allows them to discover annotate content. For both the casual user and researcher. It is part of Australian infrastructure not a purchased product. Its all NLAs services rolled into one, then with more added.

Two imperatives for the NLA – streamline and integrating the proliferation of national collection discovery tools and as per their Direction Statement, to develop online spaces for user interaction.

Trove comes from treasure trove – the latter coming from French to discover, so it combines the content and finding it.

It benefits from their experiences with Libraries Australia, Pandora, ARO and more.

Small team of five developed Trove.  Started September 2008, prototype in May 2009, nine versions of prototype and released version 1.0 in November 2009.  Three updates since then.

Challenges: collection views, works and versions, what is online?
Collection views: search results are grouped into collection views. Need to decide what they would be. Newspapers and people were easy, the rest was not so easy. Realised that they were working from a library view – recruited a group of students, teachers, family historians and general public to card sort the different types into groups and got them to name the groups.  Then used the group names to get people to put types into them.   The results were: books-journals etc, pictures and photos, Australian newspapers, diaries-letters, and much more.

Creating metadata for these groups was very difficult. Rules are not perfect, so they know that there are items which are in the wrong groups. Hopefully in future, users will be able to suggest alternatives.

Trove is FRBRish. Has a similar structure, with some variations. Trove takes old MARC records and make them do new things.

Issues with determining online access. Easy to discover a resource is online, but hard to discover what the item is and whether access is free. Three types identified: available online, available online (access condition), possibly online.

Want users to add value – they can tag, split and merge records, fix the OCR on the newspapers. Enhancements are included in a separate layer. It improves the quality, as evidenced by the Australian newspapers project.

They can monitor what users are doing online, in terms of interaction with the content. Comments have been added to Trove by users. Eg, photo had comment from person’s grandmother, giving more biographical detail: newspapers have been corrected and more information provided.

Future developments: currently working on RSS feeds, enhanced sorting, more external targets, more full text, an API. Then – search and delivery of NLA digitised journals, inclusion of journal article indexing data from partner vendors, more goals for obtaining data from archives and museums.

Trove release comes after three years of discussion and development. Takes resource discovery to a new level. There are other products out there that will do the same. Trove is different, includes more unique content and is national.

Paul Hagon – Everything I know about cataloguing I learned from watching James Bond.

Senior web person at the secret society of librarians at Canberra – also known as NLA.

Newspapers used to be papers in metal filing drawers, all carefully labelled with metadata – then fed into a microfilm reader. Services like Trove allow the discovery down to deep content – the metadata has been relegated to the rear. Content now rocks and metadata is relegated.

All full text searching of the  newspapers is made possible through OCR. Deep content searching is possible with text, but what about images?  Computers are good at identifying mathematical markers within images. Begin with facial recognition. Can we use this on our collections on a global scale. Chose a series of photos on a range of Australian Prime Ministers, using iPhoto. Laborious process to do, but didn’t do too well at identifying people accurately – 32%.  OpenCV – from Intel was tried out – didn’t try to identify people, just tried to identify a face. When it did, it boxed it. It was very successful in identifying two photos of the same person, regardless of context. Didn’t do so well of people in profile or poor quality images.  Was successful 85% of the time.

What could it be used for? If you do a search on Parks, get people, town and feature. If you click on portraits, you would get images as well.

Also did work on colours. Broke down images into colours, recognising both the colours and the % of the image that had that colour. Some colours can be lost however, as there is not enough of the value to display this. Can go up to 64 colours (from 8) to pick those up, but then data storage requirements grow dramatically.

Did more testing with ImageMagick – which can analyse an image – shows the RGB values which can be stored in the database.  You can then search the database just by colour. Can end up with different types of images depending on which colours you search.

http://1104.nla.gov.au -go and play and get feedback to Paul.

Why research? Computer applications are already using this technology. Iphone – Shazam app – identifies music that is being played and gives you more info about it. Etsy craft store lets you search by colour. Google Goggles – take a photo and it analyses a feature and brings back info on it. Pattern recognition in an item, no metadata required.

  • discovery layers, metadata, VALA2010

Michelle McLean

Part-time librarian, full-time wife and mother, who loves working in a public library and playing with virtual services and new technologies.

Search Connecting Librarian

Tags

30 blog posts in 30 days ALIA ALIA Dreaming 08 ALIA Online 2019 catalogue cloud community digital collections digitisation discovery layers ebooks history IFLA information services leadership librarians Library 2.0 Library Day in the Life makerspaces management marketing motivation networking NLS4 Pandora privacy public libraries reading reference collection repositories security staff state library of victoria subscribing technology training twitter user generated content users vala14 VALA2010 vala2014 VALA 2018 wikis writing

Categories

Archives

RSS Connecting Librarian

  • ALIA Online 2019 – Day 3 March 2, 2019
    Day 3 – Thursday 14th February 2019   Revitalising first nations languages: keeping culture strong in the digital world – Terry Janke Estimate that there are only 20 Indigenous languages being used in every day speech. 90% of languages are endangered and because they are an oral race, there is limits to what is written […]
    Michelle McLean
  • ALIA Online 2019 – Day 2 March 2, 2019
    Day 2 – Wednesday 13th February 2019 Connecting with users and enriching the library experience in the digital age – Carla Hayden (Librarian of Congress) The importance of reading can not be underplayed. In USA history, African Americans who learnt to read were severely punished, as were the people who taught them to read. “Palaces […]
    Michelle McLean
  • ALIA Online 2019 – Day 1 March 2, 2019
    Day 1 – Tuesday 12 February 2019 Genevieve Bell – Wonder in the age of AI: art, creativity and possibility SIRAC, the first computer stored memory, began its life at Sydney, but then most of its life at Melbourne University. It taught an entire generation about computers and it was used to process data about […]
    Michelle McLean
  • VALA 2018 – Day Three – Thursday 15 February – Disruption Day February 17, 2018
    And finally – great finish to a great conference. ————————————————————– Keynote 5 – The C Equation: Content + Connection + Community = Contented Customers – David Lee King Content – libraries have traditional forms of content, but also more cutting-edge forms. Some examples are ukuleles for loan, guitars, electronic EDM devices, checking things out to […]
    Michelle McLean
  • VALA 2018 – Day Two – Wednesday 14 February – Data Day February 17, 2018
    And here are my notes from Day 2 – not including of course, the presentation that I gave with my manager Daniel Lewis. ————————————————————————————– VALA Conference 2018 – Wednesday 13 February Plenary 3 – Linked Data Liminality – Matt Miller Matt is a Metadata Librarian, programmer/developer, adjutant at a library school, worked in public and […]
    Michelle McLean
  • VALA 2018 – Day One – Tuesday 13 February – GLAM Day February 17, 2018
    Wow, it’s been two years since I posted here.  How do I know? My last posts were about VALA 2016.  And now I am back with my VALA 2018 notes.   I must post here more often.  🙂 Anyway, it was another great conference and it was my honour to be on the program committee to […]
    Michelle McLean
  • VALA 2016 – Day Three – Nancy Proctor, Karen Lauritson and so much more February 14, 2016
    On Day Three I both chaired a session and presented in another, so there are less notes, but I hope you still find them helpful/useful.   The museum as startup – Nancy Proctor (Baltimore Museum of Art) Startup – human institution designed to deliver a new product or service under conditions of extreme uncertainty – […]
    Michelle McLean
  • VALA 2016 – Day Two – Valentine Charles, Kevin Fordham and so much more February 10, 2016
    Building a Framework for Semantic Cultural Heritage Data – Valentine Charles Valentine works with the Europeana Foundation, which is the central portal for cultural heritage in Europe. Europeana has a huge range of items from European countries, including content from and related to Australia. The European Library was the model on which Europeana was based, […]
    Michelle McLean
  • VALA 2016 – Day One – R. David Lankes, Lee Rainie and so much more February 9, 2016
    Always take away great thoughts and ideas from VALA – here’s what I got from Day 1.   Librarianship: saving the world one community at a time – Dr R David Lankes Technology advances have made the world a smaller place. Expectmorelibrary.com. Not all is well in the world and librarians have a part to […]
    Michelle McLean
  • Leadership Learning Forum – State Library of Victoria – Marianne Broadbent June 18, 2015
    I was fortunate enough to attend the most recent of these annual events, with guest speaker Marianne Broadbent.  She was a very thought provoking speaker. Hope you get as much out of my notes as I did from attending the session. Marianne Broadbent – Implementing 21st Leadership at Multiple Levels Good skills to have are […]
    Michelle McLean

© 2022 Connecting Librarian.

Made with by Graphene Themes.