🚀 New! Mozilla x EleutherAI Blueprints for easier & open dataset creation

Two powerful tools for developers:

1️⃣ Audio Transcription using privacy-focused Whisper models

2️⃣ Document Conversion to Markdown for building open-text datasets

Available now on Mozilla.ai Blueprints hub: blueprints.mozilla.ai/

0

If you have a fediverse account, you can quote this note from your own instance. Search https://mastodon.social/users/MozillaAI/statuses/114382740175747816 on your instance and quote it. (Note that quoting is not supported in Mastodon.)

PDFやDOCXで保存されているドキュメントをOCRで読み取ってAIでいい感じにMarkdownなりJSONなりテキストなりにして出力する、地味に便利かも(載っているデモのOCRが正しく機能しない感じがしているけど)
blueprints.mozilla.ai/all-blueprints/convert-documents-to-markdown-format

RE:
mastodon.social/users/MozillaAI/statuses/114382740175747816

0