AHDS Visual Arts reports
search images
explore collections
about ahds visual arts
contact ahds visual arts
fine art project
depositing
guides to good practice
advice
research
learning and teaching
reports
site index
links to copyright and disclaimer and to the AHDS website Copyright and disclaimer Go to AHDS web site
 

VADS information standards review

Tony Gill, Catherine Grout and Louise Smith
20 March 1997

bullet arrow
bullet arrow
bullet arrow
bullet arrow

Introduction

This review of domain-specific information standards has been produced for participants of the 1997 Visual Arts, Museums and Cultural Heritage Metadata Workshop, organised by the Visual Arts Data Service in partnership with the Art, Design, Architecture and Media Information Gateway, the Scottish Cultural Resources Access Network and the Museum Documentation Association. It is part of a series of workshops being organised under the auspices of the Arts and Humanities Data Service and UK Office of Library and Information Networking, and is funded by the Joint Information Systems Committee of the UK's Higher Education Funding Councils.

Background

Information standards development is not an organic process which occurs in a vacuum, or an empirical exercise free from the influence of political and cultural factors—it is usually tied to the perceived needs of the day, and occasionally may even look to the achievements of yesterday. In the cultural field, we are often at pains to stress that our information standards are not technologically-driven, but this is only possible up to a point—frequently it is the development of new technology that creates new opportunities for applying our priceless information, and these in turn drive the need for new standards initiatives.

In order to make these initiatives more manageable, it’s been common practice to focus on a particular view of the available information, for example categorised either by function or by subject. The functional approach is illustrated in the Getty Information Institute’s Object ID Project: Core data attributes that could be agreed by cultural bodies, law enforcement agencies and insurance companies in their endeavours to retrieve stolen artefacts. Initiatives taking the subject-based approach include the Van Eyck Project, the focus of which is flat art, and the five information service providers under the umbrella of the Arts and Humanities Data Service (AHDS). This approach is not without its pitfalls, however, when one considers that the definition of an object (as being archaeological as opposed to fine art, for example) often has nothing to do with the inherent nature of the object itself, but is rather more related to a human viewpoint. It is possible, and indeed often desirable, to have more than one ‘subject view’ of the same object.

The diversity of disciplinary approaches is clearly illustrated in the field of terminology standards, where the emphasis in recent years has shifted to the use of thesauri and other linguistic resources to enhance information retrieval, irrespective of original cataloguing practices. Whilst there is still considerable work to be done in terms of developing terminology resources at the specialist level (at the time of writing the MDA supports terminology working groups focusing on archaeology, ethnography, medicine and transport), there is an equal effort required to ensure that such resources map into an overarching conceptual framework.

SPECTRUM: the UK Museum Documentation Standard has been implemented in a number of museums on a variety of platforms (including manual paper-based systems), thereby fulfilling the documentation requirements of the Museums and Galleries Commission’s Registration Scheme. The strength of SPECTRUM lies in its reinterpretation of documentation as relating to all the functions of the museum, not solely that of cataloguing. However, SPECTRUM is necessarily non-prescriptive in terms of the structure of the information that is captured and stored, which although key to its acceptance and adoption by the community, has inevitable implications for interoperability across diverse systems.

It is also becoming more widely acknowledged that there is a need to handle diverse (and often conflicting) interpretations. After all, in the field of cultural information the ‘right answer’ is often either elusive or manifold. In response, standards professionals are looking to the complex information capture afforded by newer methodologies such as object-oriented modelling and semantic network processing. It is no coincidence that the Documentation Committee for the International Council of Museums (CIDOC) is in the process of translating (a misnomer; rather it is a reworking) its relational data model to an object-oriented one.

The requirement to access distributed data has resulted in a number of projects but few standards; access to distributed databases is none too difficult if they are homogeneous. The current challenge is accessing heterogeneous information resources. The Consortium for the Computer Interchange of Museum Information (CIMI) has been working towards that goal for a number of years, and has implemented the CHIO pilot project utilising SGML and Z39.50, technical standards that are also being employed by the flagship EC project Aquarelle. Clearly, what is of interest to the cultural heritage sector is the potential of these standards if adopted more widely; recent work in Aquarelle has focused on separate DTDs for the so-called ‘built’ and ‘movable’ heritage, and the latter was heavily influenced by the CHIO DTD.

Metadata is a relatively new (and poorly-defined!) term for an old concept—the descriptive information about an object that assists in it’s discovery. It just so happens that the object itself is also information, hence the meta prefix.

The broad aims of the 1997 Edinburgh Workshop are to ascertain the extent to which generic (i.e. not discipline-specific) metadata standards can be applied to the information resources of the visual arts, museums and cultural heritage communities.

Like all standards initiatives, the workshop will be judged a success if it manages to build on the consensus of what’s gone before and feed into the consensus of what comes after. And indeed it is this very process which gives us pointers to the future, where standards feed standards (e.g. MDA Data Standard Þ SPECTRUM), standards feed projects (e.g. CIMI DTD Þ Aquarelle), and standards are embedded in off-the-shelf applications (e.g. SPECTRUM and AAT Þ Multi MIMSY 2000).

But these standards are not an end in themselves; they aim to enable information sharing across boundaries, whether the boundaries are subject-based, geographically-based or linguistically-based. The need for cross-boundary standards, built from a foundation of consensus from the ground up, has never been greater.

About the Authors

Catherine Grout

Louise Smith

Tony Gill

VADS Project Manager Director Former ADAM and VADS Programme Leader
Surrey Institute of Art and Design, University College Museum Documentation Association  
Falkner Road
Farnham
Surrey GU9 7DS
Jupiter House
Station Road
Cambridge CB1 2JD
 
+44 (0)1252 892723 +44 (0)1223 315760  
http://vads.ahds.ac.uk/ lsmith@mdocassn.demon.co.uk  
  http://www.open.gov.uk/mdocassn/  

The Standards

It is worth remembering that few, if any, of the following standards are discrete; many have been based on what’s gone before and/or have been mapped one to the other. It is in this tradition that the VADS metadata workshop will be following; focusing on the specific requirements for metadata standards for the community whilst at the same time ensuring that existing standards feed into any resulting work.

A domain-specific review of relevant standards for networked information discovery

bullet arrow
bullet arrow
bullet arrow
bullet arrow
bullet arrow
bullet arrow
bullet arrow
bullet arrow
bullet arrow
bullet arrow
bullet arrow
bullet arrow
bullet arrow
bullet arrow

Dublin Core and Warwick Framework

The Dublin Metadata Core Element Set, or Dublin Core, is a set of 15 descriptive elements used to provide a simple means of describing networked electronic information resources to aid more effective discovery and retrieval.

The Warwick Framework is a conceptual container architecture that can contain multiple metadata packets.

At the time of writing (March 1997), the draft definition of the 15 element set is as follows:

ELEMENT

DESCRIPTION

TITLE

The name given to the resource by the CREATOR or PUBLISHER.

CREATOR

The person(s) or organization(s) primarily responsible for the intellectual content of the resource. For example, authors in the case of written documents, artists, photographers, or illustrators in the case of visual resources.

SUBJECT

The topic of the resource, or keywords or phrases that describe the subject or content of the resource. The intent of the specification of this element is to promote the use of controlled vocabularies and keywords. This element might well include scheme-qualified classification data (for example, Library of Congress Classification Numbers or Dewey Decimal numbers) or scheme-qualified controlled vocabularies (such as Medical Subject Headings or Art and Architecture Thesaurus descriptors) as well.

DESCRIPTION

A textual description of the content of the resource, including abstracts in the case of document-like objects or content descriptions in the case of visual resources. Future metadata collections might well include computational content description (spectral analysis of a visual resource, for example) that may not be embeddable in current network systems. In such a case this field might contain a link to such a description rather than the description itself.

PUBLISHER

The entity responsible for making the resource available in its present form, such as a publisher, a university department, or a corporate entity. The intent of specifying this field is to identify the entity that provides access to the resource.

CONTRIBUTOR

Person(s) or organization(s) in addition to those specified in the CREATOR element who have made significant intellectual contributions to the resource but whose contribution is secondary to the individuals or entities specified in the CREATOR element (for example, editors, transcribers, illustrators, and convenors).

DATE

The date the resource was made available in its present form. The recommended best practice is an 8 digit number in the form YYYYMMDD as defined by ANSI X3.30-1985. In this scheme, the date element for the day this is written would be 19961203, or December 3, 1996. Many other schema are possible, but if used, they should be identified in an unambiguous manner.

TYPE

The category of the resource, such as home page, novel, poem, working paper, technical report, essay, dictionary. It is expected that RESOURCE TYPE will be chosen from an enumerated list of types.

FORMAT

The data representation of the resource, such as text/html, ASCII, Postscript file, executable application, or JPEG image. The intent of specifying this element is to provide information necessary to allow people or machines to make decisions about the usability of the encoded data (what hardware and software might be required to display or execute it, for example). As with RESOURCE TYPE, FORMAT will be assigned from enumerated lists such as registered Internet Media Types (MIME types). In principal, formats can include physical media such as books, serials, or other non-electronic media.

IDENTIFIER

String or number used to uniquely identify the resource. Examples for networked resources include cs and URNs (when implemented). Other globally-unique identifiers, such as International Standard Book Numbers (ISBN) or other formal names would also be candidates for this element.

SOURCE

The work, either print or electronic, from which this resource is derived, if applicable. For example, an html encoding of a Shakespearean sonnet might identify the paper version of the sonnet from which the electronic version was transcribed.

LANGUAGE

Language(s) of the intellectual content of the resource. Where practical, the content of this field should coincide with the NISO Z39.53 three character codes for written languages.

RELATION

Relationship to other resources. The intent of specifying this element is to provide a means to express relationships among resources that have formal relationships to others, but exist as discrete resources themselves. For example, images in a document, chapters in a book, or items in a collection. A formal specification of RELATION is currently under development. Users and developers should understand that use of this element should be currently considered experimental.

COVERAGE

The spatial locations and temporal duration characteristic of the resource. Formal specification of COVERAGE is currently under development. Users and developers should understand that use of this element should be currently considered experimental.

RIGHTS

The content of this element is intended to be a link (a URL or other suitable URI as appropriate) to a copyright notice, a rights-management statement, or perhaps a server that would provide such information in a dynamic way. The intent of specifying this field is to allow providers a means to associate terms and conditions or copyright statements with a resource or collection of resources. No assumptions should be made by users if such a field is empty or not present.


The 15 elements and their meanings have been developed and refined by a group of librarians, information and subject specialists, through an ongoing consensus-building process that has included 4 international workshops to date and the active Meta2 mailing list.

The first workshop, organised by OCLC and NCSA in March 1995 at OCLC’s headquarters in Dublin, Ohio, saw participants arriving at a consensus on the need for a simple Internet resource description record to aid information discovery. It was agreed that the record needed multidisciplinary support, and should also be very simple so that useful descriptions could be created by information providers without the need for training. The result was agreement on a set of (then) 13 elements for describing ‘Document-Like Objects’, all of them ‘repeatable, optional and extensible’.

The second workshop, held at the University of Warwick in April 1996, was organised by OCLC and UKOLN. This workshop aimed to broaden the consensus and promote interoperability across languages and subject disciplines, by acknowledging that the Dublin Core would not meet the diverse resource description needs of all communities, nor should it attempt to. The result was the Warwick Framework, a conceptual container architecture for holding multiple diverse ‘packets’ of metadata. Proposals for implementing this framework have been based both on an SGML DTD and the MIME architecture.

The third workshop in the series was held in Dublin, Ohio in September 1996. Organised by CNI and OCLC, it aimed to establish a core element set specifically for describing digital images, to ascertain whether the Dublin Core could be extended for this purpose. After much debate, it was agreed that most images were, in fact, Document-Like Objects, and therefore the Dublin Core Metadata Element Set could, with a few modifications, be usefully applied. After an intense period of debate on the Meta2 mailing list, a consensus emerged, along with some refinements to the element definitions and two additional elements (bringing the total to 15).

The fourth Dublin Core workshop, organised by OCLC, DSTC and the National Library of Australia, was held in Canberra, Australia in March 1997, where participants succeeded in arriving at a broad consensus on the need for an optional method for refining the meaning of the 15 Dublin Core elements through the application of three additional qualifiers—dubbed the Canberra Qualifiers—TYPE, SCHEME and LANG. These can be used to refine (as distinct from extend) the semantics of the 15 individual elements. Progress was also made in the application of the COVERAGE element.

Although simplicity is one of the fundamental tenets of the Dublin Core metadata initiative, there is a significant body of opinion within the stakeholder community that the ability to refine the semantics of the element set, through the addition of optional qualifiers, is essential to allow effective deployment of the Dublin Core for resource discovery.

For example, the value 759.2 (British Painting) in the SUBJECT element is effectively meaningless unless it is qualified as referring to the Dewey Decimal Classification system. Similarly, the elements DATE and COVERAGE have little meaning or value without qualification of some form; for example, does the date 07-03-97 refer to the seventh of March or the third of July, and which century is it in?

To address these shortcomings, the qualifiers TYPE, SCHEME and LANG have been agreed upon:

bullet

TYPE allows the refinement and clarification of element content such as, for example, dividing DATE into Creation Date, Date Last Modified, etc. It can also be used to identify logically grouped items of information.

bullet SCHEME refers to an existing classification system, coding scheme, glossary or thesaurus, and is used to identify the meaning of an element's content.
bullet LANG refers to the language in which the contents of the metadata is written.

Interoperability can be preserved by adopting a consistent syntax for encoding the qualifiers, in order that simple systems can simply discard the qualifiers and their values, and parse the metadata at the level of the 15 unqualified Dublin Core elements.

A separate working group has been set up to investigate registry issues for the standardisation of qualifier values, such as the logging of ‘acceptable’ coding schemes and their recognised abbreviations.

The formal acknowledgement of the requirement for qualifiers in Canberra paves the way for practical deployment of the Dublin Core.

The ability to embed structured descriptive information in WWW pages has always been recognised as a key strategic application for the Dublin Core: The following sections outline the syntax for embedding Dublin Core metadata without qualifiers in HTML, some of the reasons behind the development of the Canberra Qualifiers, and two recommended syntaxes for embedding Dublin Core metadata with Canberra Qualifiers in HTML.

The simple syntax in which Dublin Core (without the optional Canberra Qualifiers) is embedded in HTML relies upon the HTML <META> tag and takes the form:

<META NAME = DC.ElementName CONTENT = "value string">

For example:

<META NAME = DC.creator CONTENT = "William Shakespeare">

The <LINK> tag can be used to refer back to a registry of element definitions held at OCLC:

<LINK REL="SCHEMA.dc" HREF="http://purl.org/metadata/dublin_core_elements#subject">

The extended syntax described below is the first of two recommendations from the Dublin Core group, and is considered by the group to be sub-optimal; However, the preferred syntax requires changes to the HTML DTD that have not been agreed by the W3C at the time of writing. The syntax presented here conforms strictly to the HTML DTD from version 2.0 onward.

The qualifier is appended to the element name using a full stop (period or dot) as a separator:

<META NAME = DC.ElementName.TypeQualifier CONTENT = "(LANG = Language) (SCHEME = SchemeName) element value">

For example, using just the SCHEME qualifier:

<META NAME = DC.subject CONTENT = "(SCHEME=DDC) 759.2">

Using both TYPE, SCHEME and LANG:

<META NAME = DC.date.created CONTENT = "(SCHEME=ISO 639) (LANG = en) 1997-03-07">

The Dublin Core homepage is at http://dublincore.org/

IAFA Templates

Also known as ROADS Templates or WHOIS++ Templates

The IAFA templates were originally devised by the Internet Anonymous FTP Archives working party of the Internet Engineering Task Force, as a simple means to catalogue the contents of FTP archives. The records produced using the templates are stored as simple attribute-value pairs (i.e. the name of the attribute or field, followed by a separator and the field value), usually as flat ASCII text files.

There are a number of different types of IAFA Templates, used for describing different types of resources; most of the templates are used for describing individual files (documents, images, software package etc.), and use the same basic set of attribute (or field) values. There are also a number of templates for describing other types of resources, such as services, logical archives and mirrors.

Some of the attributes (fields) within each template can be repeated; these are known as variants. It is also possible to create clusters of attributes to describe USERs (i.e. people) and ORGANIZATIONs. These clusters can then be combined with a role prefix and embedded within other template types.

The attributes (fields) used in templates that describe individual files (e.g. DOCUMENT, IMAGE, VIDEO, FAQ etc.) are shown below:

Attribute Name:

Description.

Handle:

Handle of object.

Category:

Type of object. See Note <1>

Title:

Complete title of the object.

URI-v*:

Description of access to object.

Short-Title:

Summary title (if the Title is very long).

Alternative-Title:

An alternative to the Title or Short-Title title fields.

Author-(USER*):

Description/contact information about the authors/creators of the object.

Admin-(USER*):

Description/contact information about the administrators/maintainers of the object.

Source:

Information as to the definitive version of the object.

Requirements:

Any requirements for the use of the object. A free text description of any hardware/software requirements necessary to use the object.

Description:

Description (that is, "abstract" in the case of documents) of the object.

Citation:

The preferred method of referring to this object when used in other works.

Publication-Status:

Current publication status of object (draft, published etc.).

Publisher-(ORGANIZATION*):

Description/contact information about object publisher.

Copyright:

The copyright statement. Any additional information on the copying policy may be included.

Creation-Date:

The creation date for the object.

Discussion:

Free text description of possible discussion forums (USENET groups, mailing lists) appropriate for this object.

Keywords:

Appropriate keywords for this object.

Version-v*:

A version designator for the object.

Format-v*:

Formats in which the object is available. (See Note <2>)

Size-v*:

Length of object in bytes (octets).

Language-v*:

The name of the language in which the object is written. For documents this would be the natural language. For software this would be the programming language.

Character-Set-v*:

The character set of the object. This should be a well-known value for example "ASCII" or "ISO Latin-1".

ISBN-v*:

The International Standard Book Number of the object.

ISSN-v*:

The International Standard Serial Number of the object.

Last-Revision-Date-v*:

Last date that the object was revised.

Subject-Descriptor-Scheme-v*:

Name of classification scheme used in the corresponding Subject-Descriptor field.

Subject-Descriptor-v*:

A classification mark for the resource.

To-Be-Reviewed-Date:

Date on which the resource is expected to require reassessment.

Record-Last-Verified-Email:

E-mail address of the person who last verified this template.

Record-Last-Verified-Date:

Date of the last verification of this template.

Comments:

Comments by the template creator(s).

Destination:

Records which database(s) a template will go into once evaluation procedures have been completed.

<1> The intention of this field is to define the category of the object. For example, in the case of documents it could be "Technical Report", or "Conference Paper" and the name and date of the conference at which the paper was presented. It may also be something like "General Guide" or "User manual".

<2> Objects are often available in several formats. For example, documents may be in PostScript, ASCII text, DVI etc. For images this may be GIF, JPEG, TIFF etc. Format should be specified in MIME type syntax and semantics where possible (See RFC1521).

The following table shows the attributes in the USER cluster:

Attribute Name:

Description

Name:

Name of individual.

Work-Phone:

Work telephone number of individual.

Work-Fax:

FAX (facsimile) telephone number of individual.

Work-Postal:

Postal address of individual.

Job-Title:

Job title of individual (if appropriate).

Department:

Department to which individual belongs.

E-mail:

Electronic mail address of individual.

Handle:

Unique identifier for this record.

Home-Phone:

Home telephone number of individual.

Home-Postal:

Home postal address of individual.

Home-Fax:

FAX (facsimile) telephone number of individual.

The following table shows the attributes in the ORGANISATION cluster:

Attribute Name:

Description.

Name:

Name of organization.

Type:

Type of organization (University, commercial organization etc.)

Postal:

Postal address of organization.

City:

City of organization.

State:

State, province or county of organization.

Country:

Country of organization.

E-mail:

Electronic mail address of organization.

Phone:

Phone number of organization.

Fax:

Fax number of organization.

Handle:

Handle of organization.

Early implementations of the IAFA Templates for their intended purpose, the description of FTP archives, include Nexor’s ALIWEB search service and the Linux Software Map.

They are also employed by a number of other initiatives, most notably the ROADS software (Resource Organisation And Discovery System) used by a number of the Access to Network Resources projects of the UK’s Electronic Libraries Programme, for example ADAM (Art, Design, Architecture and Media Information Gateway, http://adam.ac.uk/), OMNI (Organising Medical Networked Information, http://omni.ac.uk/) and SOSIG (Social Science Information Gateway, http://sosig.ac.uk).

Categories for the Description of Works of Art

This section is based on the AITF web pages available at http://www.getty.edu/research/institute/standards/cdwa/index.html

Produced by the Art Information Task Force (AITF), the Categories for the Description of Works of Art are guidelines for formulating the content of art databases. They articulate an intellectual structure for descriptions of objects and images: in this sense they constitute a schematic representation of the requirements and assumptions implicit in the practice of the discipline of art history.

By providing a single, encompassing framework for descriptive information about works of art, the Categories are intended to enhance compatibility between diverse systems that wish to share art information. Such a standard will contribute to the integrity and longevity of information transmitted across networks and moved to new systems. Above all, it will give researchers consistent, reliable access to information stored in a variety of systems in far-flung places.

The framework defined by the Categories represents information that is suitable for sharing and exchange. The context for such an exchange might include preparing a catalogue for an exhibition, doing research for a scholarly essay, or locating works of art to see on a study trip. While much of the information defined by the Categories may also be found in local museum collections management systems, the task force decided that the intricacies of collections management such as object tracking are best defined by local museum professionals.

The AITF Categories for the Description of Works of Art focus upon "movable" objects and their images, including paintings, works on paper, sculpture, ceramics, metalwork, furniture, design, performance art, and so on, from all periods and all geographic regions. Although the Categories do not deal with the built environment, the depth and richness of the information the Categories articulate may serve as a model for descriptive standards in a range of fields in the humanities, including architecture.

Through its survey and review process, the task force identified 26 distinct kinds of information found in descriptions of works of art. While art historians may not automatically think of content in terms of categories of information, categorisation is in fact implicit in much of their work. The traditional catalogue entry reflects discrete, recognisable groups of information usually distinguished typographically for easy access by the reader. In a typical exhibition catalogue the artist, title, date, owner, provenance, measurements, materials and techniques, and current location are clearly separate groups of information. The AITF simply extends that process to the full range of information found in descriptions of works of art.

To make this categorisation neutral and applicable to many areas of study, the AITF has deliberately chosen broad category names. Thus, for example, the Creation category provides for both known and unknown individual artists, as practised in the study of European art, and for tribal and cultural group identification as used in African or Oceanic studies. The Creation category also accommodates all the other circumstances surrounding the creation of a work of art such as date, commission, and place.

Categories indicated as contain the minimal information that is needed to uniquely identify a work of art; these constitute "primary access points" in an automated environment. Of course this is not meant to imply that only the minimal amount of information recommended should be recorded about any given work of art.

Each category, in turn, may contain discrete items of information that a scholar expects to find about works of art. The sub-categories within each category identify and define those specific bits of information, grouping them together in a logical way. The sub-categories essentially describe the specific items of information that would be included in either a narrative or fielded description dealing with the category. As on the category level, sub-categories indicated as constitute primary access points for the minimal information needed to uniquely identify a work of art.

Object/Work

Creation

Classification

Ownership/Collecting History

Orientation/Arrangement

Copyright/Restrictions

Titles Or Names

Styles/Periods/Groups/Movements

State

Subject Matter

Edition

Context

Measurements

Exhibition/Loan History

Materials and Techniques

Related Works

Facture

Related Visual Documentation

Physical Description

Related Textual References

Inscriptions/Marks

Critical Responses

Condition/Examination History

Cataloguing History

Conservation/Treatment History

Current Location

Van Eyck (Visual Arts Network for the Exchange of Cultural Knowledge)

The project was driven by its art historical content rather than by the development of technology: the definition of a core record was at its heart. It is intended to provide a definition of the minimum fields needed to retrieve information about art images. It was arrived at by analysing the structure of several existing art image databases including those of the Project partners and identifying the commonly occurring fields. The process of identifying fields was accompanied by rigorous research in order to define data validation and content. Since its inception it has become a standard for those setting up art image databases. The Core Record underwent constant revisions during the course of the Project as the result of user trials and internal discussions. It was finally completed in 1996. The fields selected were divided into mandatory and non-mandatory categories for the purpose of implementation in the final system; the mandatory fields are shown below:

Artist Record
Object Record
Image Record
Artist Name
Title
Image Filename
Place of Birth
Title Keyword
Image ID
Gender
Earliest Object Date
Object ID
Birth Date Start
Latest Object Date
Morelli Fields*
Birth Date End
Owner Location Name
Source of Image
Place of Birth
Owner Location
Copyright
Death Date Start
Dimensions and Unit
 
Death Date End
Type of Object
 
Nationality
Object ID
 
Artist ID
Artist ID
 

*the inclusion of a visual matching software within the Van Eyck system which allows the user to search on the visual characteristics of an image as well as its text record

It was decided that a minimum record for art image data should be divided into three entities. The Artist or originator of the original work which is defined as the Object and the Image or images derived from it—which exist as a reproduction in a given archive. This lead to the definition of the following relational structure consisting of three tables with one to many relationships.

N.B. For copies of detailed Project reports on the Core Record please apply to Catherine Gordon, Witt Library, Courtauld Institute, Strand, London.

Visual Resources Association (VRA) Core Record

This section is based on the VRA web pages available at http://www.oberlin.edu/~art/vra/vra.html The VRA core group of data categories has been developed for describing surrogate images of art and architecture in visual resources collections and for sharing that information electronically. The core is a level between minimal and full and suggests which data elements are required in order to describe an item in a visual resources collection in a shared environment. The core level of description may be distinguished, on one hand, from a minimal level, which would specify the fewest elements required to uniquely identify, locate, and account for an item in a visual resources collection, and, on the other hand, from a full or exhaustive level of description. The core level is more desirable; and when possible, the inclusion of categories beyond the core should be considered. Thus, core categories may serve as a template for selecting, from the total amount of documentation available, the information visual resources collections would share. The Data Standards Committee (DSC) excluded many categories of information from this core. Not listed are such object categories as ownership/collecting history and related works as well as a whole host of collection management categories related to label production and slide circulation. The committee has tried to limit its efforts in order to present something that is both coherent and useful. We hope, therefore, that visual resources professionals, software vendors, and others working on data standards will consider this initial publication as a draft of a future, and much larger, publication on data standards for visual resource collections. The core categories are mapped to Dublin Core, the Getty Art History Information Program (now Getty Information Institute) Categories for the Description of Works of Art (see above). The DSC found some shortcomings in the CDWA with regard to surrogate images and categories for describing architecture and other site-specific works of art. Consequently there was an adaptation or redefinition of some of the CDWA categories to reflect the point of view of the visual resources community. Each category is also mapped to MARC in recognition of the importance of MARC as a potential standard for visual resources collections, particularly those in libraries and archives. This should be helpful to collections considering MARC applications.

VRA Record Core Categories 3.0
Record Type
Type
Title
Measurements
Material
Technique
Creator
Date
Location
ID Number
Style/Period
Culture
Subject
Relation
Description
Source
Rights

A full description can be found at http://www.gsd.harvard.edu/~staffaw3/vra/vracore3.htm

CIDOC Standards

The Documentation Committee of the International Council of Museums (CIDOC) comprises a number of working groups whose work has resulted in standards publication. There are parallels to be drawn between such work and that of the AHDS in that there is, at one and the same time, domain specific initiatives (e.g. archaeology, contemporary art) within a generic framework (e.g. the CIDOC Data Model).

bullet The CIDOC Archaeological Sites Working Group published a Draft International Core Data Standard for Archaeological Sites and Monuments in June 1995.
bullet The CIDOC Ethnography Working Group published the International Core Data Standard for Ethnology/Ethnography in September 1996. This includes the results of a survey of ethnographic documentation practice and a core data standard for ethnographic collections.
bullet The International Guidelines for Museum Object Information were published by CIDOC in June 1995.
bullet The CIDOC Multimedia Working Group published Introduction to Multimedia in Museums, November 1996.
bullet The CIDOC Data Model is available as an Entity-Relationship model. The intention is to produce future CIDOC data models using O-O methods, techniques, and formats. While this effort is under way the current CIDOC Relational Data Model will be available, although it will not be modified or enhanced. The scope of the current relational model concentrates on the documentation of museum objects and their provenance; the O-O model will enable more specific collections management and research information to be included.

In addition the AFRICOM Inventory Working Group has completed work on a handbook of standards, the aim of which is to establish common documentation standards for use by museums throughout Africa to facilitate the development and exchange of information.

A full description can be found at http://www.cidoc.icom.org/guide/guide.htm

SPECTRUM: the UK Museum Documentation Standard

As a procedural standard detailing the data needed for 20 collections management procedures, SPECTRUM was a world first. A ‘cut-down’ version, SPECTRUM Essentials, is available at the MDA web pages at http://www.mda.org.uk/

In the context of the Edinburgh Workshop it may be useful to look at the overview of the Units of Information provided by SPECTRUM, since this groups attributes together according to the type of information they provide:

Groupings for Units of Information from SPECTRUM
Record information
Movement information
Amendment history
Condition information
Object identification information
Conservation and treatment information
Object description information
Reproduction information
Object history and association information
Audit information
Object production information
Insurance information
Object collection Information
Indemnity information
References
Valuation information
Object entry information
Exhibition and display information
Loan in information
Loan out information
Acquisition information
Despatch information
Location information
Loss information
Object location information
Disposal information

MDA Data Standard

Revised in 1991 the standard was originally developed in the 1960’s, when it focused entirely on issues of cataloguing. It was extended in the 1980’s to include some aspects of collections management, and in its latest edition consists of approximately 160 fields, which are categorised as being either Primary, Group or Common

CIMI DTD

Thanks to Richard Light for this information

CIMI (the Consortium for the Computer Interchange of Museum Information) is a group of museums and museum bodies which have come together to further the cause of pooling and exchanging museum information. CIMI aims to make progress at both a theoretical and a practical level.

CIMI’s current major project is Project CHIO (Cultural Heritage Information Online). This project aims to deliver at least 10,000 records of objects and information about folk art as a searchable online resource. This will include the full text of exhibition catalogues, wall texts, etc. as well as images and the more traditional museum database records.

CHIO aims to set up a demonstrator which will support different ‘points of view’ relating to this information. The ‘general public’ PoV will reflect the knowledge and interest of a typical museum visitor. The ‘museum’ PoV will reflect the knowledge and interest of a museum professional.

CIMI decided at an early stage in Project CHIO to use the SGML standard (Standard Generalised Mark-up Language; International Standard ISO 8879:1986) to mark up the textual resources involved in the project (exhibition catalogues and wall texts). In order to do this, it had to adopt or develop a Document Type Definition (DTD), a set of rules that defines the allowed structure of SGML documents. SGML requires that every valid document should have an associated DTD, and since no DTD’s had been developed for museum information of any sort, it was decided that a new DTD was required.

What follows is a list of the ‘atomic’ access point concepts defined for the project’s SGML DTD:

Access point Description
award An award or prize.
bibliography A complete bibliographic item (series, book, journal), or a citation from within a bibliographic item.
concept An abstract concept, for example one presented in the interpretation of the subject matter of a work of art.
context Political, social, economic, or religious events or movements associated with a work of art at its creation and over time.
context-archaeological The circumstances in which a work of art was excavated or discovered.
context-architectural The relationship between a work of art and a particular environment, built work, or open space)
context-historical Political, social, economic or religious events or circumstances associated with the work of art over time.
copyright.restrictions An identification of the individual or group that holds the rights to use, exhibit, or reproduce a work of art, along with an indication of any existing restrictions on its reproduction, exhibition, or use.
creation The creation, design, execution, or production of a work of art and its components, including all those responsible for the creation of the work, the dates of that activity, and where the creation took place.
creator Information about an individual, a group of individuals, corporate body, or cultural group that contributed to the creation, production, manufacture, or alteration of the work.
credit-line A formal public statement about the ownership, transfer of ownership, acquisition, source, or sponsorship of the acquisition of a work, suitable for use in a display label or publication.
current-location Identification of the repository that currently houses a work of art, and its geographical location.
date-range A time span, normally represented by a start date and an end date. Date ranges can also be represented (less exactly) by descriptive terms. Either, or both, ends of the range may be estimated or unknown.
event An historical event or situation. Any past occurrence, whether or not it has a specific name.
identity Characteristics that uniquely identify an entity.
identity-number Any unique identifier assigned to an object by its owner or repository.
inscription.mark A description of distinguishing or identifying physical markings, lettering, annotations, texts, or labels that are a part of a work of art or are affixed, applied, stamped, written, inscribed, or attached to the work, excluding any mark or text inherent in materials.
materials-and-techniques The substances or materials used in the creation of a work of art, as well as any production or manufacturing techniques, processes, or methods incorporated in its fabrication. This includes a description of both the materials used to create the work and the way in which they were put together.
material An identification of the materials used to create the work of art, along with an indication of where they were employed.
nationality.culture.race The national, cultural, or ethnic origins of a person.
object.work An art work, or any inanimate object.
occupation A major professional role played by an individual throughout his/her lifetime, or the major roles that define the activities or purpose of an organization.
organisation A named entity involving a number of people.
ownership The provenance or history of the ownership of an object from its creation to the present. This includes the means by which it passed from one owner to the next, an identification of any public sales involving the work or the names of any agents who aided the transfer of ownership, and the names of any dealers who handled the work or included it in their inventories. If a work has been lost, stolen, or destroyed, or has otherwise vanished from public view, this fact should also be indicated here.
person An individual. This includes anonymous individuals: useful information can still be recorded even if a person's name is unknown.
place A geographical or geopolitical entity. This includes imprecisely defined places.
process.technique The means, method, process, or technique by which a material was used in the creation of a work.
quote A piece of text taken from an external written or verbal source. Quotes will often have an associated person or bibliography.
styles.movements A style, historical period, school, or art movement.
subject The subject matter of a work of art (sometimes referred to as its content) is the narrative, iconic, or non-objective meaning conveyed by an abstract or figurative composition. It is what is depicted in and by a work of art.
subject-description A description of a work of art in terms of the generic elements of the image or images depicted in, on, or by it.
subject-identification The name of the subject depicted in or on a work of art: its iconography. Iconography is the named mythological, fictional, religious, or historical narrative subject matter of a work of art, or its non-narrative content in the form of persons, places, or things.
subject-interpretation The meaning or theme represented by the subject matter or iconography of a work of art.
title.name The title[s] or name[s] given to an entity. These include the titles of artworks.
type.classification A categorisation, either formal or informal, of an entity. Formal classifications will be taken from a publicly known system.

SCRAN Data Standard

The SCRAN (Scottish Cultural Resources Access Network) Data Standard has the following fields which have been mapped to Dublin Core and SPECTRUM:

Fields in SCRAN Data Standard
Identifier
Date
Institution code
Date type
Identity number
Event
Simple name
Event type
Simple name type
Associated resource ref
Description
Associated resource type
Person name
Original record pointer
Person name type
Author
Place name
Date last modified
Place name type
Usage
Grid Reference
Notes

Object ID

To be launched in Amsterdam in May 1997, Object ID is a core standard for the identification of art, antiques and antiquities. In the event of theft, Object ID is intended to allow the quick transmission of information about stolen objects among museums, police and customs agencies, the art trade and collectors.

Machine Readable Cataloguing (MARC)

MARC (ISO 2709/ANSI Z39.2) was first deployed in 1968, and is undoubtedly the oldest metadata format in existence. Originally developed as a standard data record structure detailing every aspect in the storage of bibliographic data (down to the specification of the physical tolerances of the magnetic tape it is stored on!), it has been modified over the years to provide a rich and flexible structure for the storage of a wide range of information types, including the description of digital resources (the 856 fields).

The key benefits of MARC are derived from the fact that it is the standard bibliographic format for libraries around the world. However, relatively uncoordinated development of the standard over the years has resulted in a situation where there are now many different ‘flavours’ of MARC in use around the globe. Work is currently underway to harmonise these different versions as the UNIMARC standard, which is expected to be completed in 1998.

MARC has been used for storing internet metadata information in the INTERCAT project http://www.oclc.org/news/oclc/cataloguing/type.htm, and is the basis of the Z39.50 Profile for a Digital Library http://www.loc.gov/z3950/agency/ An SGML DTD implementation has been produced, and can be found at http://www.loc.gov/marc/marcdtd/marcdtdback.html.

ICONCLASS

A hierarchical notation-based classification system for the indexing and organisation of the subject matter of art works and their surrogate images. Developed in The Netherlands by H. van de Waal but published in English. ICONCLASS has been deployed in many European and American museums, slide libraries, photographic archives and documentation centres; for example, it is used extensively in the Witt Index at the Courtauld Institute of Art.

Terminology Resources

For a comprehensive index of resources see WordHoard on the MDA web pages at http://www.mda.org.uk/wrdhrd1.htm

The Getty Information Institute’s Vocabulary Program constitutes of the Art and Architecture Thesaurus (AAT), the Union List of Artist Names (ULAN) and the Thesaurus of Geographic Names (TGN). The AAT is available on-line via subscription to the Canadian Heritage Information Network (CHIN).

Recent work has focused on multilinguality. CHIN is undertaking French equivalency work of the AAT, whilst the Escuela Tecnica Superior de Arquitectura, Universidad Politecnica de Madrid is doing the same in Spanish. A project to translate ICONCLASS (see above) into Italian is being undertaken by the Istituto Centrale per il Catalogo e la Documentazione (ICCD); information can be found on the ICCD web site at http://www.iccd.beniculturali.it. Another project in progress is a Multilingual Thesaurus of Religious Objects based on Suppellettile eccelsiastic, published by ICCD and Religious Objects/Objets religieux, published by the Canada/France Accord. The International Terminology Working Group have published Guidelines for Forming Language Equivalents. The MDA is in the process of co-ordinating a project proposal to the EC for a feasibility project on multilingual terminology resources.

Back to top up arrow

 
home | search | explore | about us | contacts | fine art | depositing | guides | advice | research | learning | reports | site index | ©