Mycroft Sweeper

Desktop application for document anonymization

Locally. Using AI. In Polish.

Protect personal data and comply with GDPR with a modern desktop application that automates the document anonymization process. It operates entirely locally, without uploading data to the cloud, using artificial intelligence (AI) and optical character recognition (OCR) to quickly and accurately detect personal data – in both editable and scanned documents.

Features that automate work, ensuring data security in your organization.

✅ Work locally – data never leaves your computer. The application runs entirely locally, without a connection to the cloud. All processing operations are performed on the user's device, ensuring maximum security and document confidentiality.

✅ Document support and OCR. Ability to load PDF, JPG, and other files. The built-in OCR module (optical character recognition) also allows for the analysis of documents without a text layer - such as scans or photos.

✅ Intelligent data identification

The use of AI allows for automatic recognition of data such as:

  • names and surnames,
  • PESEL, NIP, KRS, REGON numbers,
  • bank account numbers,
  • addresses,
  • dates,
  • land and mortgage register numbers and other identifying data. The application has been adapted to the Polish language and can handle inflected forms of words by analyzing the context of the statement and the structure of the document.

✅ Interactive and precise interface

  • Data panel on the left - a table with detected information, assigned categories, with the option of checking the data to be anonymized with a checkbox.
  • Document preview on the right - allows:
  • Clicking on words in a document, even if it was loaded as an image or scan, to manually add or exclude them from the anonymization list,
  • Drawing rectangles to hide larger parts of the document, including stamps, handwritten signatures, maps, charts, and other graphic elements.

✅ Exporting and saving results

  • Anonymized PDF document generated with one click,
  • Exporting the list of detected data to a JSON file – useful for documentation, auditing, or further processing.

✅ High performance and low cost of use

The application analyzes up to 100-page documents in about 3 minutes. The AI algorithms used have been optimized to work even on computers with limited computing resources – without the need to invest in expensive servers or specialized IT infrastructure. This solution is efficient, available, and scalable.

Faster, safer and cheaper – a local alternative to cloud-based solutions.

Feature

Document processing speed

Mycroft Sweeper (AI + OCR)

~2 sec/1 page

~3 min/100 pages

AI-based solutions

~18 sec./1 page

~30 min/100 pages

GPT/API integrations

~5–15 sec/1 page

~8–25 min/100 pages

Feature

Requires data transfer to the cloud

Mycroft Sweeper (AI + OCR)

No

AI-based solutions

Yes

GPT/API integrations

Yes

Feature

Operational stability

Mycroft Sweeper (AI + OCR)

High - works offline

AI-based solutions

Limited by API and traffic

GPT/API integrations

Depends on load and tokens

Feature

Local scalability

Mycroft Sweeper (AI + OCR)

Yes

AI-based solutions

Low

GPT/API integrations

Low

Feature

Operating cost

Mycroft Sweeper (AI + OCR)

Fixed, low – no API/cloud fees

AI-based solutions

Variable – document/tokens fees

GPT/API integrations

High – cost per request/token/limit

Why does a local solution win?

  • Extremely low operating costs – no fees for document processing, API tokens, or data transfer.
  • Processing speed – even 100-page document scans in minutes, not hours.
  • Security and GDPR compliance – data is not sent outside your organization.
  • Internal scalability – runs on standard hardware, without expensive servers or GPUs.
  • Ready to operate offline – even in environments with increased security requirements (e.g., public institutions, law firms, the financial sector).

Mycroft Sweeper supports public institutions and private companies where secure and compliant document anonymization is a daily necessity.

  • Public offices and institutions – responsible for publishing documents in the Public Information Bulletin (BIP) and making public information available.
  • Law and notary offices – processing documents containing clients' personal data.
  • Financial and banking sectors – requiring compliance with data protection regulations.
  • Healthcare – processing medical data and information about patients' health.
  • HR and human resources departments – managing employee documentation containing personal data.
  • Companies processing documents containing client data – e.g., in recruitment processes, customer service, or marketing.
  • Non-governmental organizations and scientific institutions – processing sensitive data as part of research, analyses, and ongoing projects.
  • IT and cybersecurity companies – offering services related to data processing and protection.

For organizations looking for tailor-made solutions.

  • Server and API implementation

    Integrate with existing IT infrastructure.

    Available in the local network (LAN) or as a service component (API) for integration with EZD, DMS, eBOK or scanning systems.

  • Custom data types

    Add detection of data specific to your organization.

    Case numbers, project codes, patient IDs, album numbers, document references, and more.

  • Document type detection

    Apply appropriate anonymization rules for different documents.

    Recognize the document (e.g. contract, statement, decision) and apply appropriate context rules.

Mycroft Sweeper Annual License Pricing

Private Sector

(net prices, VAT 23%; 12-month license with updates and support)

Single license (1 seat) – 999.00 PLN net / seat

Package

Five

Number of Licenses

5

Package Price

4,495.50 PLN

Average Cost / Seat

899.10 PLN (-10%)

Package

Team

Number of Licenses

10

Package Price

7,992.00 PLN

Average Cost / Seat

799.20 PLN (-20%)

Package

Team L

Number of Licenses

25

Package Price

17,481.25 PLN

Average Cost / Seat

699.25 PLN (-30%)

Package

Team XL

Number of Licenses

50

Package Price

29,970.00 PLN

Average Cost / Seat

599.40 PLN (-40%)

Package

Team MAX

Number of Licenses

100

Package Price

49,950.00 PLN

Average Cost / Seat

499.50 PLN (-50%)

Package

Unlimited

Number of Licenses

Unlimited within the organization

Package Price

Custom pricing

Average Cost / Seat

Custom pricing

Contact us to receive an individual quote tailored to the needs and scale of your organization.

PrivacyTerms of Service
© Mycroft Solutions Sp. z o.o.