For heritage institutions

Artificial intelligence for libraries, archives and heritage

Eight AI tools covering the 8 strategic objectives published by BNElab, integrable under a single ecosystem. Built in Europe, with data on European soil, by Videoconversion Digital Lab — over 20 years in documentary digitization.

Request institutional demo Download technical dossier

BNElab's 8 AI objectives, covered today

BNElab publishes eight strategic AI work areas. For each one, we have an operational tool with comparable world references.

Natural language querying

Multilingual RAG on documentary corpora with intent detection, parallel sub-queries and mandatory page-referenced citations. 9 native languages.

Our solution:VerbaDoc Pro Annalysis Media Annalogica

World reference:Primo Research Assistant · Overdrive Inspire Me

OCR and manuscript transcription

Classic OCR + neural HTR with Gemini Vision. Automatic detection of document type, archival series and geographic location.

Our solution:Scriptorium IA VerbaDoc Pro Annalogica

World reference:British Library · Transkribus

Image element identification and extraction

Detection of seals, maps, tables, engravings and miniatures. Visual embeddings with HNSW index for similarity search. Upscaling and restoration with dedicated GPU.

Our solution:AnnaPhoto Scriptorium IA cadex.studio

World reference:British Library MapReader · KB Bildsök

Text entity extraction

NER with confidence score: people, places, organizations, dates, events. Ready to integrate specialized models for historical Spanish.

Our solution:Scriptorium IA VerbaDoc Pro

World reference:Finlandia Annif · DNB NLP indexación

Music score recognition

Functional H-OMR prototype — one of the BNElab objectives that very few institutions worldwide address operationally.

Our solution:Partituras Musicales (H-OMR)

World reference:Pocas bibliotecas en el mundo lo cubren

Assistant chatbots

Natural language conversation over the document corpus with context memory, mandatory source citations and 24/7 availability.

Our solution:VerbaDoc Pro Annalysis Media

World reference:Primo Research Assistant

Automated cataloguing

Automatic metadata generation: Dublin Core 15 elements, MODS, METS 2.x, PREMIS v3, MIX. Document series and type suggestions with humans-in-the-loop.

Our solution:AnnaMETS Scriptorium IA VerbaDoc Pro

World reference:Ex Libris Alma Specto · Library of Congress ECD (95 % F1)

Automated classification

Classification by diplomatic type, series, language, period and geography. Human review of high-confidence labels.

Our solution:Scriptorium IA VerbaDoc Pro

World reference:Proyecto Languid (British Library — 3 M registros)

What makes us different

Advantages no international provider combines under one roof.

20+ years of experience

Videoconversion Digital Lab (Barcelona) has been digitizing documentary heritage for more than two decades. We didn't arrive at AI from a startup — we apply it to a craft we already master.

100% data on European soil

Neon PostgreSQL and Vercel hosted in the EU. Native GDPR and Spanish ENS compliance. No transatlantic transfers, no legal surprises.

Integration with MarIA (BNE)

Designed to incorporate MarIA — the Spanish language model trained by BNE on 135 billion words with MareNostrum — as an optional engine for NER and linguistic analysis.

7 material types

Text, manuscript, image, plan, map, audio, video and score — all processable in one coherent ecosystem. Other providers cover one or two.

Multi-LLM validation

Architecture with Gemini + GPT-4o + Claude voting on critical results. Keeps hallucinations below the 95% F1 threshold set by the Library of Congress.

Music score recognition

H-OMR prototype in early production — one of BNElab's objectives that very few institutions worldwide have addressed at product level.

Flexibility by institution size

Same ecosystem, three deployment sizes. From a municipal library to a national one.

Municipal · University

Up to 100,000 documents

SaaS on the mediasolam platform
Immediate deployment, no in-house infrastructure
Monthly payment, no commitment
Municipal libraries, diocesan archives, foundations

Regional · Provincial

Up to 2 million documents

Dedicated EU deployment with exclusive database
SLA and service-level support
Integration with existing catalogue (Koha, Alma)
Regional libraries, provincial archives

BNElab

National · BNElab Edition

No operational limit

Hybrid cloud + on-premises architecture
Integration with MarIA and the institutional OAIS repository
AI policy co-drafted with the institution
Dedicated L3 support and joint roadmap

The 3 transversal principles BNElab requires

Ethical, sustainable and transparent use of AI — not as a promise, but as architecture.

Ethical

Mandatory humans-in-the-loop for all automated decisions. Visible confidence scores. Right to explanation and human correction at all times. Formal responsible AI policy, inspired by the British Library's FRAIM project.

Sustainable

Energy consumption measured per model call. Aggressive embedding caching. Smaller models used when sufficient. Quarterly footprint dashboard. Compatible with on-premises MarIA to reduce data transport.

Transparent

Complete audit trail: which model processed which document, when and with which prompt. Versioned prompts. Documented reproducibility. Generated metadata in open standards (Dublin Core, MODS, METS, MARC21).

Aligned with the world's references

The leading national libraries and European networks of AI applied to documentary heritage are our reference coordinates.

CENL AI in Libraries

AI in Libraries working group of the Conference of European National Librarians. BNE is an active participant.

AI4LAM · Fantastic Futures

International community of AI applied to libraries, archives and museums. Organizers of the Fantastic Futures conferences.

ALT-EDIC

European alliance coordinated by France, with 17 member states including Spain, for language technologies.

Library of Congress ECD

Exploring Computational Description — applying transformers to over 23,000 ebooks with a 95% F1 quality threshold.

British Library MapReader

Computer vision model for historical maps, developed with the Alan Turing Institute.

KB Suecia · KB-BERT

KB-BERT, the Swedish National Library's language model, trained on on-premises Nvidia DGX with 500 years of Swedish text.

Finlandia · Annif

Automated subject indexing microservice in production at the National Library of Finland.

BNE MarIA

First massive AI system expert in the Spanish language — 135 billion words, trained on MareNostrum.

The ecosystem live

Eight applications in production. You can try them.

VerbaDoc Pro

Massive documentary extraction, multilingual RAG and AI validation.

Open application →

Scriptorium IA

HTR, NER and cataloguing of manuscripts and documentary heritage.

Open application →

AnnaPhoto

Photographic archive with semantic search and AI restoration.

Open application →

AnnaMETS

Metadata generator for METS, Dublin Core, MIX and PREMIS v3.

Open application →

Annalogica

Audio and video transcription and analysis in 9 languages.

Open application →

Annalysis Media

Visual and semantic analysis of audiovisual collections.

Open application →

cadex.studio

Data extraction from plans, maps and technical drawings with triple LLM validation.

Open application →

Partituras Musicales

H-OMR prototype for optical music score recognition (beta).

Open application →

Let's talk about your institution

All apps are operational and can be tested with your own digitized collections. Arrange an institutional demo with our team.

Request meeting Email info@videoconversion.es