Databasing botanical collections was raised as a major issue at the first meeting of the Botanical Collections Managers Group, and it wasproposed that a questionnaire should be circulated to establish whatis already happening and where the BCMG could help. Thequestionnaire was circulated in Spring 2003.
Thank you all for your responses which I have tried to summarise below. It is clear that documentation takes place in different institutions according to their priorities and collections. Obviously each are free to make such decisions, but there are several key areas where BCMG might be involved:
Software/websites etc. Especially for the smaller herbaria, much of the software etc needed for databasing and disseminating information is already available and does not need to be developed anew. The BCMG can facilitate communication for those wanting to start documentation with those who have already done similar projects.
There is clearly a huge task of documenting all our collections across the range (only 7% done to date) for which funding is required; I am not sure to the extent that raising this funding is the role of the BCMG, but it is clearly an issue.
Advice on documentation is both needed and available. Can we consider running workshops, eg on reading labels, interpreting data etc. ?
If we are going to develop one central portal for digital herbarium information, it could be practical from existing databases which have written data standards and export into Excel. It would be simpler in the long run if new projects adopted simple data standards directly compatible with existing datasets. Most databases already contain a lot of valuable information fields such as locality, collector, dates etc. and do not need to be adapted.
Perhaps a good project to start with might be UK or Irish BAP species as many are already documenting them anyway and the data have obvious immediate application (and attractive for possible funding…). This would also go towards the Global Plant Conservation Strategy Target 16 Establishing and strengthening plant conservation communicationnetworks.
As identifications are not consistently checked prior to documentation, this has huge implications for the quality of the information on the web. One advantage of uncertainty in databased records is that the users should go back to the original source to check them before using them (generates enquiries, corrects identifications etc), though I suspect this very rarely happens and the data will be used and abused anyway.
Summary of responses
Herbariaresponding: BIRM, BON, CGE, DBN, E, GLAM, GGO, GL, IMI, K, LDS,LIV, NMW (vascular), NNHS, Oldham, OXF/FHO/FHOw, WSY (17). (SKarley personal herbarium excluded)
3 | Type of collections to be databased | It was clear that the whole range of collections were being databased across the herbaria, obviously adapted to which collections held (e.g. IMI only has cryptogams so is only documenting them). Some herbaria were only planning to document some of their collections, others all (including living and DNA collections). |
4 | Geographical Coverage | Most herbaria were documenting on a world basis reflecting scope of the collections, some on a British Isles basis only. |
5 | Approximate number of specimens to be databased/number already databased. | The total number of herbarium specimens in the collections reported was 13.1 million (dominated by 8 million from Kew!), of which 870,000 had already been documented to one degree or another (7%). On average 40% of the collection in each institution had already been documented, though there is a very clear and understandable inverse exponential relationship between size of collection and % documented!!! |
6 | Are you currently databasing or have you already databased your collections? | All respondents currently databasing, other than CGE who are proposing to. A few smaller herbaria were close to completing documentation. |
7 | What software are you using to database the collections? | The responses showed a large range of different database/documentation software, with Excel/Access being the most frequent. The databases reported were (x= numbers using them):
In-house software x 2 |
9 | Estimated date of completion (if ever!) | Very few clear dates, smaller herbaria able to be more precise. |
10 & 14 | Can the software download into Excel spreadsheets, and, Do you have written data entry standards? | All reported 'yes' or 'indirectly' for downloads into Excel, and all but two for written data entry standards. Assuming that one wanted to, this might allow one central database to be compiled relatively easily from Excel provided that, say, different dates styles, can be put into one uniform system. |
11 | What information are you databasing? | Most were documenting a fairly full list of information from the labels, clearly the more information databased, the greater the use in the long run but the slower the input.
Local government/authority (x0!), was asked as such information is often used for searching for data for local BAPS; it is time-consuming to sort out (vice-county is a proxy). |
12 | How is the information published/to be made available? | It seems almost everybody is disseminating information electronically or on paper to those who want it, and websites are clearly going to be the favoured option soon:
|
13. | Are identifications checked before specimens are databased? | The responses show that the quality of identification information on databased records may be very variable; and this has consequences when they are circulated,
|
15 and 18. | Average number of specimens databased per day if known, and, Do you have staff dedicated to databasing | 30, 50, 50, 75, 125, 150 specimens per day, the rest don't know. The two main factors determining input rates are the Software, and the number of bits of information to document. Also important is the skill of the input staff; 3 had dedicated input staff, 11 either do it as part of other jobs or rely on volunteers. This has a huge implication for the financial resources required. I suspect that most decide what they want to document, and then get on with it! |
16. | % staff time databasing? | Average 15% of staff time spent documenting |
17. | How do you prioritise which specimens are databased? | The larger herbaria tended to gave a long list reflecting multiple usage. For others, priorities were determined by funding, which is often more easily obtained for specific projects!
|
18. | Are you able to offer advice to anyone who needs it? | Nine felt able to offer advice of one sort or another, so there is help out there! |
19 | What help/advice would you like if available? | The need for advice clearly revolves around funding and then websites, and then the more practical aspects of reading labels etc.
|