Discover, manage, and protect personal data precisely, locally, and securely – with the help of AI running in your environment, without the need for the cloud.
Our solutions successfully support both small local offices and law firms, as well as large financial institutions, medical facilities, and organizations with complex IT infrastructures.
We support compliance with GDPR and the Public Information Access Act. We automate the anonymization of documents before publication in the Public Information Bulletin (BIP) and help institutions manage scattered data collections.
We facilitate the safe processing and editing of documents containing personal data – from case files and reports to correspondence and publication of rulings. We protect both personal data and the institution's reputation.
We provide effective tools for locating, monitoring, and anonymizing customer data. We help meet regulatory requirements, reduce operational risk, and effectively respond to personal data incidents.
We automate the anonymization of medical documentation shared for research, analysis, or inter-unit consultations. Our solutions support GDPR compliance while minimizing the risk of patient data confidentiality breaches.
Employee computers, network resources, cloud storage, email, websites, or individual files – we scan all locations where personal data may be present.
We identify personal data and automatically classify document types – including contracts, invoices, CVs, and many others.
We enable deletion or relocation of files containing personal data from unauthorized locations and their automatic anonymization – fast, precise, and in line with your organization’s security policy.
The entire analysis and processing happens within your infrastructure. No data is sent outside – you retain full control.
The Mycroft system consists of independent, specialized tools that you can deploy separately or combine into one integrated package. Each module addresses specific organizational needs in locating, anonymizing, and protecting personal data.
Searches computers, cloud drives, and email for files containing personal data. Enables rapid threat detection, risk assessment, and compliance with the minimization principle and GDPR requirements.
A tool for automatic document anonymization – including scans and image-only PDFs. It detects personal data, allows quick removal, and the entire process runs locally without cloud involvement.
A specialized tool for public administration. Automatically analyzes the content of Public Information Bulletins (BIP) and detects unintentional disclosures of personal data before they result in GDPR violations.
Monitors public websites, repositories, and other online sources for unintended personal data disclosures. Helps organizations quickly detect privacy breaches and mitigate their impact.
All tools can operate as standalone applications or be integrated into the client’s systems – including as a local API or library.
If you’re developing your own solution that requires advanced personal data detection – we provide our technology as a software library.
You can integrate our library with your system – desktop apps, server applications, or batch processing.
We support integration in an on-premise model or as an internal service running in your environment – without sending data outside the organization.
We offer exactly the same technology that powers Mycroft Sweeper and Guard – including AI models and OCR mechanisms.
Our technology can become a key component of your document workflow system, portal, records repository, or data-sharing platform.
Our engine classifies documents based on their content, structure, and layout. It can automatically distinguish common formats (e.g., invoices, contracts, administrative decisions, court filings, CVs, forms) and learn to recognize custom document types specific to a given organization – aligned with its structure and internal workflows.
We extract from documents data such as names, addresses, account numbers, identifiers (e.g., PESEL, NIP), case numbers, and dates – everything necessary for process automation, document classification, or preliminary indexing.
Our components can be embedded in your system as local libraries or made available through internal APIs – ensuring full control over data processing.