Skip to content

[Feature] Add Openverse as a Data Source #184

@Babi-B

Description

@Babi-B

Problem

The project currently has GitHub and GCS as automated data sources, but not Openverse. Openverse provides a large collection of openly licensed media, which will greatly enhance the breadth and depth of this data observatory

Description

Openverse aggregates data from several other openly licensed repositories like Flickr. It provides:

Alternatives

  • Work on another source

Additional context

  • Still understanding the project and solving this issue with one simple PR at a time
  • Openverse is compatible with the project structure for tracking CC Legal tools usage

Implementation

  • I will be implementing this feature
  • Focus on a single non-monolithic script scripts/1-fetch/openverse_fetch.py
  • Design script to run from the repository via pipenv
  • include --enable-save and --enable-git for consistent behavior with other scripts

Metadata

Metadata

Assignees

Labels

Projects

Status

Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions