Mycroft Sweeper

Desktop application for document anonymization

Locally. Using AI. In Polish.

Protect personal data and comply with GDPR with a modern desktop application that automates the document anonymization process. It operates entirely locally, without uploading data to the cloud, using artificial intelligence (AI) and optical character recognition (OCR) to quickly and accurately detect personal data – in both editable and scanned documents.

Features that automate work, ensuring data security in your organization.

✅ Work locally – data never leaves your computer. The application runs entirely locally, without a connection to the cloud. All processing operations are performed on the user's device, ensuring maximum security and document confidentiality.

✅ Document support and OCR. Ability to load PDF, JPG, and other files. The built-in OCR module (optical character recognition) also allows for the analysis of documents without a text layer - such as scans or photos.

✅ Intelligent data identification

The use of AI allows for automatic recognition of data such as:

  • names and surnames,
  • PESEL, NIP, KRS, REGON numbers,
  • bank account numbers,
  • addresses,
  • dates,
  • land and mortgage register numbers and other identifying data. The application has been adapted to the Polish language and can handle inflected forms of words by analyzing the context of the statement and the structure of the document.

✅ Interactive and precise interface

  • Data panel on the left - a table with detected information, assigned categories, with the option of checking the data to be anonymized with a checkbox.
  • Document preview on the right - allows:
  • Clicking on words in a document, even if it was loaded as an image or scan, to manually add or exclude them from the anonymization list,
  • Drawing rectangles to hide larger parts of the document, including stamps, handwritten signatures, maps, charts, and other graphic elements.

✅ Exporting and saving results

  • Anonymized PDF document generated with one click,
  • Exporting the list of detected data to a JSON file – useful for documentation, auditing, or further processing.

✅ High performance and low cost of use

The application analyzes up to 100-page documents in about 3 minutes. The AI algorithms used have been optimized to work even on computers with limited computing resources – without the need to invest in expensive servers or specialized IT infrastructure. This solution is efficient, available, and scalable.

Faster, safer and cheaper – a local alternative to cloud-based solutions.

Feature
Mycroft Sweeper (AI + OCR)
AI-based solutions
GPT/API integrations

Document processing speed

~2 sec/1 page

~3 min/100 pages

~18 sec./1 page

~30 min/100 pages

~5–15 sec/1 page

~8–25 min/100 pages

Requires data transfer to the cloud

No

Yes

Yes

Operational stability

High - works offline

Limited by API and traffic

Depends on load and tokens

Local scalability

Yes

Low

Low

Operating cost

Fixed, low – no API/cloud fees

Variable – document/tokens fees

High – cost per request/token/limit

Why does a local solution win?

  • Extremely low operating costs – no fees for document processing, API tokens, or data transfer.
  • Processing speed – even 100-page document scans in minutes, not hours.
  • Security and GDPR compliance – data is not sent outside your organization.
  • Internal scalability – runs on standard hardware, without expensive servers or GPUs.
  • Ready to operate offline – even in environments with increased security requirements (e.g., public institutions, law firms, the financial sector).

Mycroft Sweeper supports public institutions and private companies where secure and compliant document anonymization is a daily necessity.

  • Public offices and institutions – responsible for publishing documents in the Public Information Bulletin (BIP) and making public information available.
  • Law and notary offices – processing documents containing clients' personal data.
  • Financial and banking sectors – requiring compliance with data protection regulations.
  • Healthcare – processing medical data and information about patients' health.
  • HR and human resources departments – managing employee documentation containing personal data.
  • Companies processing documents containing client data – e.g., in recruitment processes, customer service, or marketing.
  • Non-governmental organizations and scientific institutions – processing sensitive data as part of research, analyses, and ongoing projects.
  • IT and cybersecurity companies – offering services related to data processing and protection.

For organizations looking for tailor-made solutions.

  • Server and API implementation

    Integrate with existing IT infrastructure.

    Available in the local network (LAN) or as a service component (API) for integration with EZD, DMS, eBOK or scanning systems.

  • Custom data types

    Add detection of data specific to your organization.

    Case numbers, project codes, patient IDs, album numbers, document references, and more.

  • Document type detection

    Apply appropriate anonymization rules for different documents.

    Recognize the document (e.g. contract, statement, decision) and apply appropriate context rules.

Mycroft Sweeper Annual License Pricing

Private Sector

(net prices, VAT 23%; 12-month license with updates and support)

Single license (1 seat) – 999.00 PLN net / seat

Package
Number of Licenses
Package Price
Average Cost / Seat

Five

5

4,495.50 PLN

899.10 PLN (-10%)

Team

10

7,992.00 PLN

799.20 PLN (-20%)

Team L

25

17,481.25 PLN

699.25 PLN (-30%)

Team XL

50

29,970.00 PLN

599.40 PLN (-40%)

Team MAX

100

49,950.00 PLN

499.50 PLN (-50%)

Unlimited

Unlimited within the organization

Custom pricing

Custom pricing

Contact us to receive an individual quote tailored to the needs and scale of your organization.

PrivacyTerms of Service
© Mycroft Solutions Sp. z o.o.