Preprints of the
Metadiversity
Conference
Proceedings
Session 2: The Challenge in Species Discovery and
Taxonomic Information
Conventions, Standards, and Consensus in Systematic
Practice: How Far Can (or Should) We Go?
PETER F. STEVENS, Professor of Biology, University
of Missouri, St. Louis, and Curator, Missouri Botanical
Garden
|
ABSTRACT
Different kinds of
standards and conventions are first discussed. The
Taxonomic Databases Working Group (TDWG) both
endorses the use of particular standards and
conventions to help improve interoperability
between the various parts of systematics, and also
actively promotes their development. Plant taxa of
different levels of the hierarchy are in part
convention but are not standards. Judicious
application of both standards and convention can
reduce the labor of describing species. The extent
to which standards can be applied to reduce
disagreement about the limits of species is
discussed; it is argued that taxonomists cherish
their opinions in part because of our poor
understanding of this most basic of systematic
operations. At the level of plant names, the
International Plant Names Index (IPNI) aims to
become a metastandard by developing bibliographic
standards for plant names, initially focusing on a
database of well over 1 million names of flowering
plants. The database will be replicated,
distributed, self-archiving, and will reflect
corrections and additions (which can be made
directly by anybody in the systematic community) in
real time. A prototype module containing author
names and abbreviations has been developed. IPNI
will both use existing TDWG standards and develop
new ones. It is a nomenclatural database, making no
judgment on synonymy. |
"The nice thing about
standards is that you have so many to chose from;
furthermore, if you do not like any of them, you can just
wait for next year’s model" (Tanenbaum 1981, 168).
The goal of this talk is to
begin to explore the general issue of the role of standards,
conventions and metadata in systematic practice in general
and species and their description and delimitation in
particular. After a few necessary definitions, I will talk
very briefly about the activities of the Taxonomic Databases
Working Group (TDWG) in promoting standards and developing
metadata. I then will suggest that thinking about
conventions and standards helps illuminate the nature of
plant taxa, and in particular the constraints on how species
are delimited. I then will show how TDWG standards and
convention are integral to the developing International
Plant Names Index (IPNI) a database of plant names
(initially flowering plants only). IPNI, with its
bibliographic information on plant names, kept current and
enabling linkages to other kinds of data, provides metadata
useful in any biological project that uses the names of
flowering plants. The standards developed, and others, can
be used to simplify many aspects of systematic practice.
Although we should try to standardize much of the routine of
systematic work, the use of inappropriate standards can make
answering some kinds of systematic questions difficult.
Standards, Conventions,
Consensus and Metadata
First, some necessary
definitions. A standard, to quote the Oxford English
Dictionary is an "examplar of measure or weight"–hence "an
authoritative or recognized exemplar of correctness,
perfection, or some definite degree of any quantity." A
convention, on the other hand, is "a rule or practice based
on general consent." The word convention quite often has
negative connotations–it is artificial, arbitrary and
formal, while a standard refers to something in the real
world, or at least something we think is more important.
Consensus, "the collective unanimous opinion of a number of
persons," is one way in which standards and conventions
become effective. Standards in particular come in a variety
of flavors, and we can usefully distinguish here between de
facto, de jure, and formal standards (the last are standards
accepted by a standards body like the International
Standards Organisation (cf. Mowbray & Malveau 1997)).
Standards that deal with "correctness" can be called
reference standards; if they refer to rules or actions
necessitated by a body of systematic theory, they can be
called rule-based standards; and if they deal with "some
definite degree of any quantity," they can be called metric
standards. These distinctions are not exclusive, but they do
emphasize how heterogeneous "standards" are. Metadata are
data about data, or data that underpin the use of data in a
particular way; and standards and conventions are involved
in the development of metadata.
The Taxonomic Databases
Working Group (TDWG)
The International Working
Group on Taxonomic Databases was set up in 1985 to establish
international collaboration among database projects. Its
goal is to aid in the dissemination and exchange of
information by focusing on the common use and interpretation
of terminology, data fields, dictionaries, common logical
rules, and data relationships. Basically, it deals with
metadata and standards. Initially botanical in focus, it has
broadened out over the years, although there is a pressing
need to become yet more active.
The role of TDWG is that of a
facilitator, and it may evaluate and endorse existing
standards or get groups of people together to make new ones.
The kind of projects in which it has been involved include
defining standards for data collection in economic botany
(Cook 1995), geographic names (Hollis & Brummitt 1992),
abbreviations for authors (Brummitt & Powell 1992), while it
endorses such important standards as a format for recording
and exchanging descriptive taxonomic data, DELTA
(Descriptive Language for Taxonomy) (Dallwitz & Paine 1986),
and the herbaria acronyms used when specimens are cited (Holmgren
et al. 1990). Many such standards are in common use, greatly
facilitating communication among biologists.
Of course, new authors
describe new species for the first time, new herbaria are
formed, and so new editions of these standards are produced,
or, for the Web versions, they are kept current–and for our
future efforts, that is where the emphasis will be. Indeed,
with the decentralization now possible with such projects
(see below), we need not get into the situation of having an
out-of-date standard or a standard that is independently
improved by different groups–parallel evolution here, as
elsewhere, is the bane of systematics! As the need arises,
we can develop new–and extend the coverage of–existing
standards. Is there confusion in how botanists and
zoologists cite collections? What about author and journal
abbreviations and contractions across all groups, not just
plants? As lists of taxa (this refers to groups at any level
of the taxonomic hierarchy, from kingdom to variety) become
ever more comprehensive, these are issues that we may need
to consider. Codes of nomenclature are also standards and
metadata, and again, there is a need to establish whether
there is any call for a unified code of biological
nomenclature.
Conventions, Standards,
and Systematics
I begin by thinking of the
higher levels of the taxonomic hierarchy–classes, families,
genera, etc.–in the context of standards and conventions. If
taxa at the same rank, e.g., a genus, were equivalent, this
might provide some kind of metric for measuring diversity.
However, for most systematists higher taxa are simply groups
that contain one or usually more lower taxa. They may
delimit groups that a systematist for some reason or other
thinks are particularly interesting, but that does not
convert to any equivalence of groups at the same
hierarchical level (Stevens 1997a). There is widespread
acceptance of the principle of monophyly, and this means
that all higher taxa have to be monophyletic, i.e., contain
all and only the descendents of a particular common
ancestor, but this is a grouping, not a ranking criterion.
However, not only is there still no consensus here, but
monophyly is anyway a rule-based, not a metric, standard.
Given the constraints of monophyly, it is convention driven
by a need to communicate that leads us to give a named group
a particular circumscription; we apply names at ranks like
order, family, and genus to those parts of a taxonomy that
are commonly used in teaching and in general communication
between biologists (Angiosperm Phylogeny Group 1998: a
phylogenetic naming system will not escape this need to act
as a communication system), and so they are in the context a
kind of reference standard. But whatever one’s systematic
philosophy, there is no measurement unit inherent in a
biological classification. If I walk two miles, I have
unarguably walked twice as far as if I had walked one mile;
if my state has 500 genera of plants and yours has 1000, I
cannot say that your state is twice as diverse as mine.
Similarly, although it may be something of a convention to
use numbers of species in estimates of diversity and amount
of radiation (systematists have suggested ways of
quantifying these), they, too, are a poor metric for this.
What about conventions and
these species? Here I touch both on the act of writing
descriptions and on how we go about circumscribing
species–and I should say that nearly all the descriptive
systematic work I do is on tropical plants.
Writing Descriptions
Linnaeus used a brief, almost
telegraphic style for his descriptions, and he defined and
illustrated the terms he used (Linnaeus 1751). Over the
years Linnaeus’s definitions of shape terms and the like
were modified and alternative usages crept into use.
Definitions of some shape terms were reviewed by a committee
formed under the aegis of the Systematics Association and
their consensus published (Anonymous 1962) and adopted by
Stearn (1992) in his invaluable "Botanical Latin." These
have become reference standards, standards for
communication, to be used as a matter of good practice by
all systematists when describing plants. Unfortunately, they
neither map onto discontinuities in nature nor provide a
metric for shape. They are imprecise and even positively
misleading if it is assumed that two taxa described as
having, say, a differently-shaped lamina apex are thus
necessarily assignable to two discretely different shapes,
or if it assumed that two taxa with the same shape
necessarily do not have discretely different shapes.
That the definitions of these
shape terms, although accessible, are difficult to apply
without reference to diagrams–yet they generally are so
used–points to a problem with many reference standards.
Accessibility and ease of use are essential for their
adoption, for them to become standards de facto. This is
particularly true of color standards, such as the set
devised by Ridgway (1913) and published in a single edition
of 4,000 copies. Its problems were evident by the middle of
the century (Hanly 1949), but it is still in use, partly
because Ridgway focused on colors common in the living
world. But other color standards have inevitably (given the
printing history of Ridgway’s book, and the fact that the
colors there are not all stable) come into use, and these
include the Munsell and Royal Horticultural Society color
charts, that proposed for British fungi (Henderson et al.
1969), and so on. These later standards are more accessible,
although they may focus on different parts of the spectrum.
In any case, many biologists prefer to use terms like "red,"
"violet," and "lilac"–terms in common use but without
precise definitions.
Units for measuring size may
be arbitrary in the sense that there are all sorts of units
used by different cultures or groups. However, inches and
centimeters are interconvertible, and they provide a metric
by which measurements can be related to one another and
manipulated in a way that is impossible with many biological
descriptors, which are ultimately reference standards.
Biological descriptors refer to a variety of kinds of
objects, the extremes of which are terms like "ovule" and
"radius"–structures that are thought to have arisen only
once in the course of evolution–and "wing", "stipule", and "campylotropy"–for
which there is no such evidence. The former often have more
precise definitions and are commoner in vertebrate zoology
than in flowering plants (e.g., cf. Peters 1964; Wheeler et
al. 1993). Reference terms like "acute" are widely used,
although not necessarily with identical definitions.
Circumscribing Species
I find it unsettling that
many of the operations we carry out as we look at specimens
and estimate variation patterns–that is, much of systematic
practice–amount to a convention, and a largely unanalyzed
one at that (cf. Stuessy 1994, who stresses the development
of "skills of intuitive [sic] pattern recognition," albeit
in the context of well-studied temperate groups). The
operational species concept many of us use (by "us," I mean
systematists in general) is "a species is what a competent
systematist says it is" (Regan 1926). This concept–or
perhaps more accurately, convention–has been with us for
over 150 years (e.g., Stevens 1991, 1997b). In practice this
concept/convention has often been used to promote broadly
delimited species, with a reduction in the numbers of
species recognized being one of its useful results–a rather
strange argument, one would have thought. Species are also
explicitly based on discontinuities in variation pattern,
and there are conventions for how much and what kind of
variation is needed to indicate different levels in the
taxonomic hierarchy at and below the level of species (e.g.,
Davis & Heywood 1963; Mayr & Ashlock 1981). But there is
disagreement over such conventions, and subspecies and
varieties may be differently defined by systematists in
different countries (Hamilton & Reichard 1992), or the
significance of variation ornithologists had previously
called subspecific may be questioned (see Cracraft 1992, on
the numbers of species of birds of paradise). It is still
more unsettling to one who has long wished for closer links
between academia and the museum to observe the lack of
connection between those who theorize about the nature of
species, a very active area and largely in academia, and
those who describe species (McDade 1995), an area that is
somewhat in regress and is largely carried out in museums.
A monographer is
privileged–he or she can analyze material from throughout
the range of a group, judiciously evaluate the variation,
and make decisions about the limits of taxa with the "big
picture" in mind. But without clarification of criteria for
collecting and evaluating data on variation, this privilege
amounts to little. I have recently been looking at a widely
ranging species of Rinorea (Violaceae) growing from Myanmar
and Hainan to Borneo (Jarvie & Stevens 1998; cf. Jacobs &
Moore 1961). There is not one, but three species–one with
two subspecies, and another two species are still to be
described. Similarly, in a recent treatment of Fagraea (Gentianaceae),
42 species from Borneo are recognized, of which 24 are
endemic, while in an earlier treatment the corresponding
figures are 14 and 3 (Wong & Sugau 1996; cf. Leenhouts
1962). Importantly, in both cases material seen by the
earlier authors was assigned by the later authors to
different, and in Fagraea, often newly described species;
little that was completely new had turned up in the
intervening years. But if I ask the questions, Who is right?
and why are there differences? it is remarkably difficult to
provide answers. We are not sure of whether or not we are
delimiting species for a herbarium taxonomist, and we are
not sure of the significance of finding locally
discontinuous variation that appears to break down in a
geographically broader study (e.g., Stevens 1993; Cronk
1999). All too often we seem not to know why (for whom?) and
how we should analyze variation (critical comments made by
Heywood 1974, about taxonomy are still, unfortunately, all
too relevant).
The answers to such questions
go beyond the kind of convention and standard I have been
discussing, although they start there. We desperately need
to articulate what is involved in taxonomic practice if the
cherished independence of systematists–the wish each of us
has for our own opinions on the limits of species to be
taken seriously–is to amount to much. We must work towards
clarification of how we collect and analyze data,
articulating reference and rule-based standards where
necessary, and clarifying and justifying the role convention
plays in the whole systematic process.
The International Plant
Names Index
Although at the level of
species delimitation we are dealing both with unruly nature
and unruly naturalists, there are less problematical areas
where standards, conventions, and metadata can facilitate
the speed and ease with which we do our basic systematic
work. The Plant Names Project and its main product, the
International Plant Names Index (IPNI), serves to illustrate
this.
The names in IPNI will be
those currently in the Index Kewensis, based at the Royal
Botanic Gardens, Kew; the Gray Herbarium Index, at Harvard;
and the Australian Plant Names Index, at Canberra. IPNI will
initially include well over 1,000,000 names of flowering
plants, and we hope to extend the coverage to all vascular
plants in the current phase of the project. In short, it
aims to be an authoritative list of all plant names and
their place of publication, kept current, freely available
and query-able, decentralized, and maintained with a minimum
of bureaucracy. It will of course take some time to meet all
these ambitious goals. (Further details of the project may
be found in Croft et al. (submitted)
http://pnp.huh.harvard.edu.)
The names will reside in a
distributed database, and although only the three
institutions just mentioned are involved at present, IPNI is
designed to be highly scalable. All new names appearing
(including citations of types specimens–types are themselves
standards that allow us to relate names to plants with the
minimum of ambiguity) will be added from Kew, Harvard, or
Canberra and will be accessible as soon as they are added.
TDWG standards for geography, abbreviations, and
contractions of names and journals, etc., will be followed;
these standards, too, will be kept current and generally
accessible.
Of considerable importance is
the submissions module that has been developed. There are
four aspects of the submissions mechanism that I want to
mention here. First, although these changes will appear in
IPNI as soon as they are made, it will be clear to all users
that they have not been checked. Editors, whether of
particular taxa, particular types of names (e.g., names
described by Linnaeus), or of particular directories (e.g.,
periodical lists), will check the changes. Second, any and
all changes suggested will become part of the permanent
contribution history of that record, and this history, too,
can be accessed by users. Third, by making it easy for the
whole community to submit additions and corrections, the
difficult process of editing and verifying the existing
entries will be greatly facilitated. Fourth, the submission
module itself can be adapted for a variety of uses, so it,
too, can become a kind of standard. For instance, it could
serve as the mechanism by which names proposed for a
phylogenetic naming system are recorded.
IPNI is a nomenclatural list,
including the original citation of a name and, where
possible, that of its type specimen. Any name referring to
vascular plants should be traceable in it, and for most
purposes the information it contains will function as
metadata, serving to ground names in the literature. To IPNI
can be linked images of the protolog, including any
illustrations, images of the type, etc. It can be mirrored
to institutions and projects, and all projects linked with
it will be synchronized with it and kept up-to-date with the
information it contains, while corrections to it made in the
course of work on any of these projects can be made
immediately accessible to all.
Conclusions
Much of the work of
describing species and writing floras and checklists is
almost formulaic. Reference standards and conventions for
journal abbreviations, citation of type and other specimens
and of synonymy, etc., are very desirable, and their use
should be promoted by groups like TDWG, professional
societies, etc. They certainly should become de facto
standards, if not formal standards, and failure to use them
leads to the waste of an inordinate amount of time by
systematics. Reference standards may be arbitrary. There may
be more than one way of handling journal and book
abbreviations and contractions, but so long as the way
adopted works, its principles are explicit, and the
consequences of its adoption are understood. That is all
that matters. Until fairly recently it was a rite of passage
for all new flora projects to argue interminably about how
names should be handled–should the Australian botanist
Ferdinand von Mueller be F. von Muell., F. Mueller, F. v.
M., or something else? With a list of author names and
abbreviations, this need no longer happen; we can use either
a full name or a single contraction. The fewer standards
covering such aspects of a species description, the more our
work will be speeded up; this is not an area where having
several standards is a good thing (cf. the epigraph).
Simplification and standardization not only speeds up our
work, but it makes it much easier for a student to
understand what we are doing. But there are circumstances in
which the use of a particular standard may be inappropriate.
For writing a flora, the use of plain language and of
reference standards such as those in Stearn (1992) is
essential, but metric standards are needed for actually
deciding on the limits and relationships of species. Here
more than one standard is involved. (Note that metric
standards can be converted accurately to reference
standards, but the reverse operation may well introduce
imprecision (see also Heywood 1984).)
I have barely begun to
explore the role and interaction of standards, conventions,
and metadata in systematics. However, thinking of such
issues in the context of systematic practice will help us to
understand better what we do, to gather appropriate data
more quickly, and will allow meaningful consensus over the
limits of the taxa we use to describe the living world–and
this makes the important link to projects like ITIS, BONAP,
and Species 2000.
Acknowledgements
I thank W. L. Alverson, D. E.
Boufford, J. Cadle, N. Cross and F. Pando de la Hoz for
helpful discussion. The support of the NSF and USGS (NSF
DEB-9726045 and DBI-9808220) is gratefully acknowledged.
References
Angiosperm Phylogeny Group.
1998. An ordinal classification for the families of
flowering plants. Ann. Missouri Bot. Gard. 85:531-553.
Anonymous. 1962. Systematics
Association committee for descriptive biological terminology
II. Terminology of simple symmetrical plane shapes (Chart
1). Taxon 9:145-156, 245-247.
Brummitt, R. K., and C. E.
Powell. 1992. Authors of plant names. Royal Botanic Gardens,
Kew.
Cook, F. E. M. 1995. Economic
botany data collection standards. Royal Botanic Gardens,
Kew.
Cracraft, J. 1992. The
species of the birds-of-paradise (Paradisaeidae): Applying
the phylogenetic species concept to a complex pattern of
differentiation. Cladistics 8:143.
Croft, J., N. Cross, E. Nic
Lughadha, P. F. Stevens, J. G. West, and G. Whitbread. Plant
names for the 21st century: The International Plant Names
Index, a distributeddata source of general accessibility.
Taxon, accepted.
Cronk, Q. C. B. 1998. The
ochlospecies concept. Ch. 11 in: Huxley, C.R., J. M. Lock,
and D. F. Cutler (eds), Chorology, Taxonomy and Ecology of
the African and Madagascan Floras. Kew Bulletin Additional
Series, H.M.S.O., London.
Dallwitz, M. J., and T. A.
Paine. 1986. User’s guide to the DELTA system. Ed. 4.
Division of Entomology, Commonwealth Scientific and
Industrial Research Organisation, Australia.
Davis, P. H., and V. A.
Heywood. 1963. Principles of angiosperm taxonomy. Oliver &
Boyd, Edinburgh.
Hamilton, C. W., and S. H.
Reichard. 1992. Current practice in the use of subspecies,
variety and form in the classification of wild plants. Taxon
41:485-498.
Hamly, D. H. 1949. Robert
Ridgway’s color standards. Science 109:605-608.
Henderson, D. M., P. O.
Orton, and R. Watling. 1969. British fungus flora. Vol. 1.
H.M.S.O., Edinburgh.
Heywood, V. H. 1974.
Systematics–the stone of Sisyphus. Bot. J. Linn. Soc.
6:169-178.
Heywood, V. H. 1984.
Electronic data processing in taxonomy and systematics.
pp.1-15 in Allkin, R. and F. A. Bisby (eds), Databases in
systematics. Academic Press, London.
Hollis, S. and R. K. Brummitt.
1992. World geographic scheme for recording plant
distributions. Hunt Institute for Botanical Documentation,
Carnegie Mellon University.
Holmgren, P. K., N. H.
Holmgren, and L. C. Barnett. 1990. Index herbariorum. Part
1. The herbaria of the world. Ed. 8. New York Botanical
Gardens, New York. Jacobs, M., and D. M. Moore. 1971.
Violaceae. pp. 179-212 in van Steenis, C. G. G. J. (ed.),
Flora malesiana. Series 1, Vol. 7. Wolters-Noordhoff,
Groningen.
Jarvie, J. K., and P. F.
Stevens. 1998. New species and notes on Violaceae and
Flacourtiaceae from Indo-Malasia. Harvard Pap. Bot.
3:255-264.
Leenhouts, P. W. 1962.
Loganiaceae. pp. 293-387 in van Steenis, C. G. G. J. (ed.),
Flora malesiana. Series 1, Vol. 6. Wolters-Noordhoff,
Groningen. Linnaeus, C. 1751. Philosophia botanica...
Kiesewetter, Stockholm.
Mayr, E., and P. D. Ashlock.
1981. Principles of systematic zoology. Ed. 2.
McGraw Hill, New York.
McDade, L. A. 1995. Species
concepts and problems in practice: Insights from botanical
monographs. Syst. Bot. 20:606-622.
Mowbray, T. J., and R. C.
Malveau. 1997. Corba design standards. John Wiley, New York.
Peters, J. A. 1964.
Dictionary of herpetology. Hafner, New York.
Regan, C. T. 1926. Organic
evolution. Rep. British Assoc. Adv. Sci. 1925, 75-86.
Ridgway, R. 1912 (1913).
Color standards and color nomenclature. The Author,
Washington, D.C.
Stearn, W. T. 1992. Botanical
Latin. Ed. 4. David and Charles, Newton Abbott.
Stevens, P. F. 1990. Species:
Historical perspectives. pp. 302-311 in Keller, E. F., and
E. Lloyd (eds), Keywords in Evolutionary Biology. Harvard
University Press, Cambridge, Mass.
Stevens, P. F. 1997a. How to
interpret botanical classifications–suggestions from
history. BioScience 47:243-250.
Stevens, P. F. 1997b. J. D.
Hooker, George Bentham, Asa Gray and Ferdinand Mueller on
species limits in theory and practice: A
mid-nineteenth-century debate and its repercussions. Hist.
Records Austral. Sci. 11:345-370.
Stuessy, T. F. 1994. Case
studies in plant taxonomy: Exercises in applied pattern
recognition. Columbia University Press, New York.
Tanenbaum, A. S. 1981.
Computer networks. Prentice Hall, Englewood Cliffs, New
Jersey.
Wong, K. M., and J. B. Sugau.
1996. A revision of Fagraea (Loganiaceae) in Borneo, with
notes on related Malesian species and 21 new species.
Sandakania 8:1-93.
Previous |
Next
Questions:
Email us or Call (215)
893-1561
Copyright © 2003 NFAIS. All rights
reserved. No part of this product or service may be
reproduced, stored in a retrieval system or transmitted in any
form or by any means, electronic, mechanical, photocopying,
recording or otherwise, without prior written consent.
Privacy
Policy |