Skip to content

LibraryOfCongress/concordia

This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.

Folders and files

NameName
Last commit message
Last commit date
Apr 15, 2024
Aug 15, 2023
Apr 16, 2024
Apr 17, 2024
Jun 12, 2023
Mar 18, 2024
Apr 16, 2024
Dec 6, 2018
Feb 27, 2024
Aug 8, 2022
Jan 3, 2019
Apr 24, 2019
Jun 12, 2023
May 24, 2019
Nov 1, 2019
Jun 15, 2018
Jan 5, 2024
Jun 12, 2023
Apr 17, 2024
Jun 15, 2018
Jan 23, 2024
Feb 22, 2024
Nov 1, 2018
Oct 4, 2018
Nov 7, 2018
Apr 17, 2024
Apr 17, 2024
Apr 5, 2024
Sep 13, 2022
Aug 11, 2023
Apr 24, 2019
Dec 13, 2023
Oct 4, 2023
Oct 4, 2023
Jul 17, 2018
Apr 16, 2024
Apr 16, 2024
Dec 19, 2023
Oct 31, 2023
Aug 27, 2020
Sep 15, 2022

Repository files navigation

Lint Test Build Coverage Status

Welcome to Concordia

Concordia is a platform developed by the Library of Congress (LOC) for crowdsourcing transcription and tagging of text in digitized images with the dual goals of collection enhancement and public engagement. Concordia is a user-centered project centering the principles of trust and approachability. Read our full design principles here. Learn more about the Concordia development process in this Code4Lib article.

LOC launched the first iteration of Concordia as By the People at crowd.loc.gov in October 2018.

The Library of Congress publishes transcriptions created by By the People volunteers on loc.gov to improve search, readability, and access to handwritten and typed documents. Individual transcriptions are published alongside the transcribed images in digital collections and transcriptions are also published in bulk as datasets. Learn more about how we publish transcriptions.

Concordia code and the By the People transcriptions are released into the public domain. Anyone is free to use or reuse the data. More info on our licensing page.

As of May 2022 the Library of Congress Concordia development team has moved issues out of Github to an internal system due to reporting needs. Open github issue tickets may not be active or up-to-date. We continue to publish our code here as it is released. Learn more about How We Work.

Concordia and By the People are supported by the National Digital Library Trust Fund.

What Concordia does

The application invites volunteers to transcribe and tag digitized images of manuscript and typed materials from the Library’s collections. All transcriptions are made by volunteers and reviewed by volunteers. It takes at least one volunteer to transcribe a page and at least one other volunteer to review and mark it complete. Some complex documents may pass through both transcription and review many times before they are accepted as complete by a volunteer.

Concordia is a containerized Python-Django-Postgres-etc web application. The Library hosts its instance in the cloud.

Concordia leverages the publicly-available loc.gov API to call collection metadata and images in JPEG format and save copies for use in Concordia. Completed transcriptions can be exported out of the application as a single CSV or individual TXT files in a BagIt bag.

Want to use or reuse our code?

For more on our tech stack and to learn how to set up the Concordia on your computer, check out the For Developers page.

Want to help?

We're excited that you want to be part of Concordia! Here are two ways to contribute:

1. Report bugs by submitting an issue. If you are reporting a bug, please include:

  • Your operating system name and version.
  • Any details about your local setup that might be helpful in troubleshooting.
  • Detailed steps to reproduce the bug.

2. Create an issue to give feedback or suggest a new feature. The best way to give feedback is to file an issue at https://github.com/LibraryOfCongress/concordia/issues. If you are proposing a feature:

  • Explain in detail how it would work.
  • Explain how it would serve Concordia via a user story
  • Keep the scope as narrow as possible, to make it easier to implement.

If you use or build on our code, we'd love to hear from you! Contact us here at ask.loc.gov.