OpenSciMetrics (OSM) applies NLP and LLM-based metrics and indicators related to transparency, data sharing, rigor, and open science on biomedical publications.
The Open Science Metrics Dashboard visualizes data sharing and code sharing trends across biomedical research funders and journals, based on analysis of PubMed Central publications.
Live Dashboard: https://www.opensciencemetrics.org
The repository also contains a command line tool for processing individual PDFs and XMLs. Note: This tool is not currently being actively maintained.
With docker-compose and python >=3.11 installed:
pip install .
osm -f path/to/pdf-or-xml -u uuid
For processing many files, start docker-compose dependencies separately:
docker compose up # In one terminal
osm -f path/to/pdf-or-xml -u uuid --user-managed-compose # In another terminal
To set up a development environment:
pip install -e .
docker compose -f compose.yaml -f compose.development.override.yaml up --build
In another terminal:
export OSM_API="http://localhost:80"
osm -f path/to/pdf-or-xml -u uuid --user-managed-compose
Pre-commit runs checks on every commit. To install:
pip install pre-commit
pre-commit install
On Apple silicon, you must use emulation:
export DOCKER_DEFAULT_PLATFORM=linux/amd64
docker pull mongo:4.4.6
Note: PDF parsing does not work on Apple silicon.