Skip to content
View MarcinMikula's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report MarcinMikula

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
MarcinMikula/README.md

QA Engineer building AI-powered test automation β€” 13+ years across telco, banking & insurance, now turning that domain depth into autonomous testing tools.

I test billing platforms, CRMs, and self-service portals for a living β€” and build agents that test (and heal) them for fun. Currently exploring how far LLMs can go in closing the loop between bug found β†’ bug reproduced β†’ test fixed, without a human re-typing a selector ever again.

13+ yrs QA  β†’  Python / Playwright / pytest  β†’  AI-assisted automation  β†’  ???

πŸ”­ What I'm building

Project What it does
πŸ”₯ PhoenixQA Self-healing test framework β€” LLM diagnoses broken selectors (Shadow DOM, dynamic attrs) and repairs them, Safe or Autonomous mode, learns from every decision
πŸ› defect-pilot AI agent that reproduces a Jira bug and generates a Playwright retest script from the ticket alone
πŸ§ͺ llm-qa-toolkit LLM-as-judge framework for testing chatbots in regulated industries β€” hallucination, prompt injection, regression
πŸ—οΈ qa-automation-framework The skeleton underneath it all β€” Playwright + pytest, POM/SOM, SQLAlchemy, enterprise telco/CRM/billing context

These aren't isolated demos β€” they're one ecosystem. qa-automation-framework is the chassis, PhoenixQA keeps its selectors alive, defect-pilot closes bugs against it, llm-qa-toolkit applies the same AI-QA thinking to a different problem: testing LLMs themselves.


🧠 How I work

  • I only put a technology on my CV once I've actually built something with it β€” no checkbox-collecting
  • Domain knowledge is the differentiator I lean on: I've debugged billing migrations and credit-risk logic that most automation engineers have never seen
  • AI is a tool I use daily (Claude, Cursor) rather than a buzzword on a slide. Most projects above include an AI-assisted component that can be inspected directly in the code.
  • Pragmatic over theoretical: I'd rather ship a working POC with a LEARNINGS.md than a perfect architecture diagram that never runs

🌱 Currently exploring

  • Self-healing automation strategies β€” the core problem PhoenixQA is solving
  • LLM evaluation and benchmarking β€” Safe vs Autonomous healing accuracy
  • Agentic QA workflows β€” agents that read a bug ticket and act on it (defect-pilot)
  • Local vs cloud LLM trade-offs for regulated/enterprise test data

πŸ› οΈ Stack

Python Playwright pytest SQLAlchemy Jira API Ollama Claude / Anthropic API GitHub Actions Allure


πŸ“ Based in Warsaw β€” open to remote/hybrid roles

nofluffjobs profile · ⚽ Fan of AI, Python, and Spanish football

Pinned Loading

  1. qa-automation-framework qa-automation-framework Public

    Enterprise QA automation framework | Playwright + pytest | POM Β· SOM Β· SQLAlchemy | Telco/CRM/Billing context

    Python

  2. llm-qa-toolkit llm-qa-toolkit Public

    A framework for testing LLM-based chatbots in regulated industries (telco, banking, insurance). Covers hallucination detection, prompt injection resistance, response quality scoring and regression …

    Python 1

  3. defect-pilot defect-pilot Public

    Software quality starts with test process quality. AI-powered QA gatekeeper for Jira + Salesforce(an example application) β€” enriches bug reports, enforces completeness standards, and automates rete…

    Python

  4. PhoenixQA PhoenixQA Public

    πŸ”₯ Self-healing test automation framework. When a Playwright selector breaks, PhoenixQA diagnoses the failure with LLM, proposes a fix, and learns from every decision. Local (Ollama) or API (Anthrop…

    Python