Skip to content
Crowdsourcing & Citizen History

Choose HTR when your material is voluminous and reasonably consistent and speed matters; choose crowdsourcing when the handwriting is varied or damaged, when public engagement is a goal in itself, or when the collection is too small to justify training a model. In practice the strongest option is often a hybrid: run HTR first and have volunteers correct it. The right answer is driven by your sources, your scale, and what you want the project to achieve beyond a transcript.

What is the core trade-off?

HTR (Handwritten Text Recognition) is fast and tireless but needs consistent input and training effort; crowdsourcing is flexible and engaging but needs people, coordination and reconciliation. They optimise for different things:

FactorHTRCrowdsourcing
Speed at scaleVery fastSlow, paced by volunteers
Handles varied handsPoorly without per-hand modelsVery well
Setup effortGround truth + trainingPlatform + guidelines + community
Public engagementNoneHigh — a goal in itself
Rich indexingLimitedExcellent (subject tagging)
Marginal cost per pageNear zero once trainedVolunteer time, every page

When is HTR the right call?

Reach for HTR when you have thousands of pages of consistent handwriting — a single scribe's letter-books, an institution's minute series, a census in one clerk's hand. Tools like Transkribus and eScriptorium train a model from a few dozen ground-truth pages and then transcribe the rest at machine speed. The break-even is roughly: if producing ground truth and training costs less effort than the collection would take to crowdsource, HTR wins.

text
Single-scribe ledger, 8,000 pp, consistent hand   → HTR
Mixed correspondence, 200 hands, 1,500 pp          → crowdsourcing
3,000 pp typescript with annotations               → OCR/HTR + human correction

When is crowdsourcing the right call?

Choose crowdsourcing when any of these hold:

  • The handwriting is highly varied (many scribes, centuries, languages).
  • The material is damaged, faded or marginal-heavy, where humans read context machines miss.
  • Public engagement is part of the mission — you want a community around the records.
  • You need rich subject indexing of people and places, which volunteers do naturally.
  • The collection is too small to repay model training.

Crowdsourcing also produces something HTR cannot: a community of people who now care about your archive.

Why is the hybrid usually best?

The false choice is "machines or people." A human-in-the-loop workflow runs HTR first and presents the output to volunteers for correction. Correcting is much faster than transcribing from scratch, the volunteers catch what the model gets wrong, and — if you feed corrections back as new ground truth — the model improves over time:

text
1. HTR transcribes the batch (fast, imperfect)
2. Volunteers correct the output (engaging, faster than blank-page)
3. Corrections become new ground truth
4. Retrain → model improves → less correction next batch

Transkribus and similar platforms support exactly this loop, and it often delivers the best accuracy-per-effort.

How do you decide in practice?

Work through four questions in order:

  1. How consistent is the hand? Very consistent points to HTR; very varied points to crowdsourcing.
  2. How much is there? Thousands of consistent pages favour the upfront cost of a model.
  3. Is engagement a goal? If yes, crowdsourcing earns its keep beyond the transcript.
  4. Do you have ground-truth and ML capacity? If not, the hybrid lets you start with HTR and add people.

Answer "consistent, large, have ML capacity" and choose HTR; answer "varied, modest, engagement matters" and choose crowdsourcing. Anything in between favours the hybrid.

What about cost — is HTR free?

No. HTR moves cost rather than removing it: you pay in compute, software (Transkribus uses credit-based pricing), and the expert time to produce ground truth and validate models. Crowdsourcing moves cost into community building and reconciliation. Budget honestly for whichever you pick.

Key Takeaways

  • HTR suits large, consistent collections; crowdsourcing suits varied, damaged or engagement-driven ones.
  • The hybrid — HTR then volunteer correction — is often the best accuracy-per-effort.
  • Crowdsourcing uniquely produces rich subject indexing and an engaged community.
  • Training an HTR model pays off at roughly thousands of consistent pages.
  • Neither approach is free; both move cost rather than removing it.
  • Decide using hand consistency, scale, engagement goals and ML capacity.

Frequently Asked Questions

Is HTR more accurate than crowdsourced transcription?

It depends on the hand. On consistent single-scribe material with a trained model, HTR can match or beat volunteers and is far faster; on highly varied or damaged hands, careful human transcribers still win.

When is crowdsourcing the better choice?

When your material is highly varied, when public engagement is itself a goal, when you need rich subject indexing, or when the collection is too small to justify training an HTR model.

Can I combine HTR and crowdsourcing?

Yes, and it is often the best option. Run HTR first, then have volunteers correct the output — a 'human-in-the-loop' workflow that is far faster than transcribing from scratch and improves on raw HTR accuracy.

How much material justifies training an HTR model?

As a rough guide, thousands of pages of reasonably consistent handwriting. Below that, the effort of producing ground truth and training rarely pays back versus crowdsourcing or manual transcription.

What does HTR cost compared with crowdsourcing?

HTR shifts cost to compute, software and the expert time to make ground truth and train models. Crowdsourcing shifts cost to community building, platform setup and reconciliation. Neither is simply free.