Importing Existing Data: Herbarium

After deciding that we should start building the portal using the data from an existing Symbiota installation, I concentrated my efforts into importing the databases from the Herbarium into the prototype. It turned out to be a task more complicated than I had anticipated, but it is finally done. The prototype uses the current Symbiota core, which is newer than the one used by the Herbarium. A few compatibility issues arose, but everything is working again now. The most important component of this import was to make sure that the taxonomic thesaurus was updated and contained all taxa represented in the collection. Figuring out the table strcuture in the two Symbiota installations (the source and the target) was essential to maintain data quality. Since Symbiota’s documentation does not explicit the structure of tables or relationship between them, I will describe them here, in hope that this information can be useful again someday.

While accessing the database for a given Symbiota installation, if one wants to gather, in a single view, the taxonomic names and ranks, and also their parent ranks and names, three tables should be queried:

A few queries that have been generated on the source database are listed:

The queries were then exported to .csv files, zipped, and imported into the prototype using the Batch Upload a Taxonomic Data File option in the Sitemap (which points to the ../taxa/admin/taxaloader.php page).