Bio-logger Ethogram Benchmark: A benchmark for computational analysis of animal behavior, using animal-borne tags

dc.contributor.affiliationEarth Species Project - Hoffman, Benjamin
dc.contributor.affiliationEarth Species Project - Cusimano, Maddie
dc.contributor.affiliationUniversidad de León - Baglione, Vittorio
dc.contributor.affiliationUniversidad de León - Canestrari, Daniela
dc.contributor.affiliationCentre national de la recherche scientifique Borea - Chevallier, Damien
dc.contributor.affiliationGeorgia College & State University - DeSantis, Dominic L.
dc.contributor.affiliationAfrican Institute for Mathematical Sciences, Stellenbosch University - Jeantet, Lorène
dc.contributor.affiliationDepartment of Conservation, New Zealand - Ladds, Monique A.
dc.contributor.affiliationOsaka University - Maekawa, Takuya
dc.contributor.affiliationUniversity of Texas, El Paso - Mata-Silva, Vicente
dc.contributor.affiliationUniversidad de León - Moreno-González, Víctor
dc.contributor.affiliationUniversidad de León - Trapote, Eva
dc.contributor.affiliationUniversity of Helsinki - Vainio, Outi
dc.contributor.affiliationTampere University - Vehkaoja, Antti
dc.contributor.affiliationNagoya University - Yoda, Ken
dc.contributor.affiliationEarth Species Project - Zacarian, Katherine
dc.contributor.affiliationUniversity of California, Santa Cruz - Friedlaender, Ari
dc.contributor.authorHoffman, Benjamin
dc.contributor.authorCusimano, Maddie
dc.contributor.authorBaglione, Vittorio
dc.contributor.authorCanestrari, Daniela
dc.contributor.authorChevallier, Damien
dc.contributor.authorDeSantis, Dominic L.
dc.contributor.authorJeantet, Lorène
dc.contributor.authorLadds, Monique A.
dc.contributor.authorMaekawa, Takuya
dc.contributor.authorMata-Silva, Vicente
dc.contributor.authorMoreno-González, Víctor
dc.contributor.authorTrapote, Eva
dc.contributor.authorVainio, Outi
dc.contributor.authorVehkaoja, Antti
dc.contributor.authorYoda, Ken
dc.contributor.authorZacarian, Katherine
dc.contributor.authorFriedlaender, Ari
dc.date.accessioned2025-03-24T15:23:17Z
dc.date.issued2024-04-10
dc.date.issued2024-04-10
dc.descriptionThis repository contains the datasets and experiment results presented in our arxiv paper: B. Hoffman, M. Cusimano, V. Baglione, D. Canestrari, D. Chevallier, D. DeSantis, L. Jeantet, M. Ladds, T. Maekawa, V. Mata-Silva, V. Moreno-González, A. Pagano, E. Trapote, O. Vainio, A. Vehkaoja, K. Yoda, K. Zacarian, A. Friedlaender, "A benchmark for computational analysis of animal behavior, using animal-borne tags," 2023. Standardized code to implement, train, and evaluate models can be found at https://github.com/earthspecies/BEBE/.  Please note the licenses in each dataset folder. Zip folders beginning with "formatted": These are the datasets we used to run the experiments reported in the benchmark paper.  Zip folders beginning with "raw": These are the unprocessed datasets used in BEBE. Code to process these raw datasets into the formatted ones used by BEBE can be found at https://github.com/earthspecies/BEBE-datasets/. Zip folders beginning with "experiments": Results of the cross-validation experiments reported in the paper, as well as hyperparameter optimization. Confusion matrices for all experiments can also be found here. Note that dt, rf, and svm refer to the feature set from Nathan et al., 2012. Results used in Fig. 4 of arxiv paper (deep neural networks vs. classical models){dataset}_ harnet_nogyr{dataset}_CRNN{dataset}_CNN{dataset}_dt{dataset}_rf{dataset}_svm{dataset}_wavelet_dt{dataset}_wavelet_rf{dataset}_wavelet_svm Results used in Fig. 5D of arxiv paper (full data setting)If dataset contains gyroscope (HAR, jeantet_turtles, vehkaoja_dogs):{dataset}_harnet_nogyr{dataset}_harnet_random_nogyr{dataset}_harnet_unfrozen_nogyr{dataset}_RNN_nogyr{dataset}_CRNN_nogyr{dataset}_rf_nogyrOtherwise:{dataset}_harnet_nogyr{dataset}_harnet_unfrozen_nogyr{dataset}_harnet_random_nogyr{dataset}_RNN_nogyr{dataset}_CRNN{dataset}_rf Results used in Fig. 5E of arxiv paper (reduced data setting)If dataset contains gyroscope (HAR, jeantet_turtles, vehkaoja_dogs):{dataset}_harnet_low_data_nogyr{dataset}_harnet_random_low_data_nogyr{dataset}_harnet_unfrozen_low_data_nogyr{dataset}_RNN_low_data_nogyr{dataset}_wavelet_RNN_low_data_nogyr{dataset}_CRNN_low_data_nogyr{dataset}_rf_low_data_nogyr Otherwise:{dataset}_harnet_low_data_nogyr{dataset}_harnet_random_low_data_nogyr{dataset}_harnet_unfrozen_low_data_nogyr{dataset}_RNN_low_data_nogyr{dataset}_wavelet_RNN_low_data_nogyr{dataset}_CRNN_low_data{dataset}_rf_low_data CSV files: we also include summaries of the experimental results in experiments_summary.csv, experiments_by_fold_individual.csv, experiments_by_fold_behavior.csv.  experiments_summary.csv - results averaged over individuals and behavior classesdataset (str): name of datasetexperiment (str): name of model with experiment setting fig4 (bool): True if dataset+experiment was used in figure 4 of arxiv paperfig5d (bool): True if dataset+experiment was used in figure 5d of arxiv paperfig5e (bool): True if dataset+experiment was used in figure 5e of arxiv paperf1_mean (float): mean of macro-averaged F1 score, averaged over individuals in test foldsf1_std (float): standard deviation of macro-averaged F1 score, computed over individuals in test foldsprec_mean, prec_std (float): analogous for precisionrec_mean, rec_std (float): analogous for recallexperiments_by_fold_individual.csv - results per individual in the test foldsdataset (str): name of datasetexperiment (str): name of model with experiment setting fig4 (bool): True if dataset+experiment was used in figure 4 of arxiv paperfig5d (bool): True if dataset+experiment was used in figure 5d of arxiv paperfig5e (bool): True if dataset+experiment was used in figure 5e of arxiv paperfold (int): test fold indexindividual (int): individuals are numbered zero-indexed, starting from fold 1f1 (float): macro-averaged f1 score for this individualprecision (float): macro-averaged precision for this individualrecall (float): macro-averaged recall for this individual experiments_by_fold_behavior.csv - results per behavior class, for each test folddataset (str): name of datasetexperiment (str): name of model with experiment setting fig4 (bool): True if dataset+experiment was used in figure 4 of arxiv paperfig5d (bool): True if dataset+experiment was used in figure 5d of arxiv paperfig5e (bool): True if dataset+experiment was used in figure 5e of arxiv paperfold (int): test fold indexbehavior_class (str): name of behavior classf1 (float): f1 score for this behavior, averaged over individuals in the test foldprecision (float): precision for this behavior, averaged over individuals in the test foldrecall (float): recall for this behavior, averaged over individuals in the test foldtrain_ground_truth_label_counts (int): number of timepoints labeled with this behavior class, in the training set
dc.identifierhttps://doi.org/10.5281/zenodo.10982620
dc.identifier.urihttps://hydatakatalogi-test-24.it.helsinki.fi/handle/123456789/11083
dc.rightsOpen
dc.rights.licensecc-by-4.0
dc.subjectbio-logger
dc.subjectmachine learning
dc.subjectethogram
dc.subjectanimal behavior
dc.subjectacclerometer
dc.subjectbenchmark
dc.titleBio-logger Ethogram Benchmark: A benchmark for computational analysis of animal behavior, using animal-borne tags
dc.typedataset
dc.typedataset

Files

Repositories