Mycroft Sweeper

Desktop application for document anonymization

Locally. Using AI. In Polish.

Protect personal data and comply with GDPR with a modern desktop application that automates the document anonymization process. It operates entirely locally, without uploading data to the cloud, using artificial intelligence (AI) and optical character recognition (OCR) to quickly and accurately detect personal data – in both editable and scanned documents.

The application was designed with the Polish language in mind – it effectively recognizes personal data in various grammatical forms and contexts. Thanks to a specialized detection model, it has a low false positive rate, significantly reducing the need for manual correction.

Features that automate work, ensuring data security in your organization.

✅ Work locally – data never leaves your computer. The application runs entirely locally, without a connection to the cloud. All processing operations are performed on the user's device, ensuring maximum security and document confidentiality.

✅ Document support and OCR. Ability to load PDF, JPG, DOCX, and other files. The built-in OCR module (optical character recognition) also allows for the analysis of documents without a text layer - such as scans or photos.

✅ Intelligent data identification

The use of AI allows for automatic recognition of data such as:

  • names and surnames,
  • PESEL, NIP, KRS, REGON numbers,
  • bank account numbers,
  • addresses,
  • dates,
  • land and mortgage register numbers and other identifying data. The application has been adapted to the Polish language and can handle inflected forms of words by analyzing the context of the statement and the structure of the document.

✅ Interactive and precise interface

  • Data panel on the left - a table with detected information, assigned categories, with the option of checking the data to be anonymized with a checkbox.
  • Document preview on the right - allows:
  • Clicking on words in a document, even if it was loaded as an image or scan, to manually add or exclude them from the anonymization list,
  • Drawing rectangles to hide larger parts of the document, including stamps, handwritten signatures, maps, charts, and other graphic elements.

✅ Exporting and saving results

  • Anonymized PDF document generated with one click,
  • Exporting the list of detected data to a JSON file – useful for documentation, auditing, or further processing.

✅ High performance and low cost of use

The application analyzes up to 100-page documents in about 3 minutes. The AI algorithms used have been optimized to work even on computers with limited computing resources – without the need to invest in expensive servers or specialized IT infrastructure. This solution is efficient, available, and scalable.

Faster, safer and cheaper – a local alternative to cloud-based solutions.

Feature
Mycroft Sweeper (AI + OCR)
AI-based solutions
GPT/API integrations

Document processing speed

~2 sec/1 page

~3 min/100 pages

~18 sec./1 page

~30 min/100 pages

~5–15 sec/1 page

~8–25 min/100 pages

Requires data transfer to the cloud

No

Yes

Yes

Operational stability

High - works offline

Limited by API and traffic

Depends on load and tokens

Local scalability

Yes

Low

Low

Operating cost

Fixed, low – no API/cloud fees

Variable – document/tokens fees

High – cost per request/token/limit

Why does a local solution win?

  • Extremely low operating costs – no fees for document processing, API tokens, or data transfer.
  • Processing speed – even 100-page document scans in minutes, not hours.
  • Security and GDPR compliance – data is not sent outside your organization.
  • Internal scalability – runs on standard hardware, without expensive servers or GPUs.
  • Ready to operate offline – even in environments with increased security requirements (e.g., public institutions, law firms, the financial sector).

Mycroft Sweeper supports public institutions and private companies where secure and compliant document anonymization is a daily necessity.

  • Public offices and institutions – responsible for publishing documents in the Public Information Bulletin (BIP) and making public information available.
  • Law and notary offices – processing documents containing clients' personal data.
  • Financial and banking sectors – requiring compliance with data protection regulations.
  • Healthcare – processing medical data and information about patients' health.
  • HR and human resources departments – managing employee documentation containing personal data.
  • Companies processing documents containing client data – e.g., in recruitment processes, customer service, or marketing.
  • Non-governmental organizations and scientific institutions – processing sensitive data as part of research, analyses, and ongoing projects.
  • IT and cybersecurity companies – offering services related to data processing and protection.

For organizations looking for tailor-made solutions.

  • Server and API implementation

    Integrate with existing IT infrastructure.

    Available in the local network (LAN) or as a service component (API) for integration with EZD, DMS, eBOK or scanning systems.

  • Custom data types

    Add detection of data specific to your organization.

    Case numbers, project codes, patient IDs, album numbers, document references, and more.

  • Document type detection

    Apply appropriate anonymization rules for different documents.

    Recognize the document (e.g. contract, statement, decision) and apply appropriate context rules.

Flat fee, zero per-page costs. AI, OCR, updates, and technical support included.

License
Net price
What do you gain?

Mycroft Sweeper (1 seat, 12 months)

700 PLN / year

  • • OCR + AI anonymization (PDF, DOCX, scans)
  • • No page and file limits
  • • Updates and technical support
  • • Local installation – data does not leave the computer

Sector and volume discounts

We offer preferential pricing for:

  • public sector entities,
  • educational institutions,
  • non-governmental organizations.

For larger numbers of licenses or comprehensive implementations, we provide additional volume discounts, making the total project cost even more favorable.

Contact us to receive an individual quote tailored to the needs and scale of your organization.

PrivacyTerms of Service
© Mycroft Solutions Sp. z o.o.