BSBI Distribution Database > message board

BSBI TPP dataset - duplicated records

Anything related to particular plant records or other database entries

BSBI TPP dataset - duplicated records

by AndyAmphlett » Sat Mar 31, 2018 10:21 am

Tom,

VCRs are encouraged not to waste time on looking for duplicate records, while trying not to create duplicates if possible. But there are a few datasets from BSBI (not from individual VCRs) that seem to be exacerbating the problem. A fairly recently added dataset is one that appears to have been created specifically for the Threatened Plant Project (TPP).

https://database.bsbi.org/search.php#re ... 9942ef695f

As far as I can see (have only looked at a few examples, mostly in vcs 94 & 96) the records in this dataset are all duplicates, but sometimes with date range errors cf the original record. For example McCallum Webster records in vc96 with a correct date range in the source record, have in the TPP version the end date as 2008, when she died in 1985. Would it be safe to mark the whole dataset as duplicate? That would have to be done centrally, or is it for VCRs to investigate their own vcs?

Thanks,

Andy.
AndyAmphlett
 
Posts: 320
Joined: Fri Nov 23, 2012 9:32 am
name: Andy Amphlett

Re: BSBI TPP dataset - duplicated records

by admin » Sat Mar 31, 2018 11:17 am

Hi Andy,

This dataset is a known problem. At David Pearman's instigation I obtained the underlying data from BRC and have made extensive corrections to dates.

There will still be duplicates after the reload, but please wait until then before taking any action, because the refreshed data will definitely be better than the current situation. I hope to reload the set soon - hopefully next week.
Tom Humphrey
Database Officer, Botanical Society of Britain and Ireland (BSBI)
c/o Centre for Ecology and Hydrology,Maclean Building, Crowmarsh Gifford, Wallingford, Oxon, OX10 8BB, UK.
tom.humphrey@bsbi.org
User avatar
admin
 
Posts: 369
Joined: Tue Nov 20, 2012 4:16 pm
name: Tom Humphrey


Return to Records and data

cron