🧠 PythonUnpackLLM

AI-Powered Python Bytecode Reverse Engineering Framework

PythonUnpackLLM is an automated reverse-engineering pipeline that reconstructs Python source code from compiled bytecode inside packaged executables.

It combines static bytecode disassembly with local LLM-assisted source reconstruction, designed specifically for:

Malware analysis
Red-team research
Incident response
Python packer forensics

Unlike experimental "LLM decompilers", PythonUnpackLLM focuses on stability, scale, and real-world RE workflows.

Why This Tool Exists

Reverse-engineering Python executables traditionally requires:

Extracting .pyc files
Disassembling bytecode
Manually reasoning about logic
Dealing with latest Python version

This is slow and error-prone.

PythonUnpackLLM automates the full pipeline, using AI only at the interpretation stage — while keeping extraction and disassembly fully deterministic.

The LLM is treated as an untrusted analysis component, not a source of truth.

Pipeline Overview

Executable unpacking (PyInstaller detection + extraction)
Recursive .pyc recovery
Native bytecode disassembly (no AI / extra dependencies)
Function boundary reconstruction
LLM-assisted logic reconstruction
Validation + structured output

Usage

Extract PYC from exe

python PythonUnpackLLM.py --path ./target.exe --unpack

Disassemble a single file

python PythonUnpackLLM.py --path file.pyc --asm

Decompile a single file

python PythonUnpackLLM.py --path file.pyc

Decompile entire extracted tree

python PythonUnpackLLM.py --path ./PYZ.pyz_extracted --type folder

Key Features

Detects packaging type with auto-aborts unsupported formats (saves time in RE workflows)
Built-in PyInstaller Extraction (Integrated pyinstxtractor-ng runner)
Recursive Folder Mode
Reconstructs functions from bytecode disassembly
LLM output is treated as untrusted input. This makes the tool stable even when the model fails.

Use Cases

Malware analysis
Red team tool reversing
IR investigations

Tool Comparison

Capability	PythonUnpackLLM	uncompyle6	decompyle3	pycdc	pyinstxtractor-ng	ByteCodeLLM (original concept)
Purpose	Full automated RE pipeline	Python decompiler	Python decompiler	C++ Python decompiler	PyInstaller extractor	AI-assisted bytecode reasoning
Works on EXE directly	✅ Yes (auto-unpack)	❌ No	❌ No	❌ No	⚠ Extract only	❌ No
PyInstaller extraction	✅ Built-in	❌	❌	❌	✅ Yes	❌
Recursive folder processing	✅ Yes	❌	❌	❌	❌	❌
Handles large sample sets	✅ Designed for scale	⚠ Manual workflow	⚠ Manual workflow	⚠ Manual workflow	❌ Extraction only	❌ Research prototype
Uses AI reconstruction	✅ Local LLM	❌	❌	❌	❌	✅ Yes
Deterministic bytecode analysis	✅ Yes	✅	✅	✅	❌	⚠ Partial
Trust model for AI output	✅ Treated as untrusted	N/A	N/A	N/A	N/A	❌ Not isolated
Function boundary reconstruction	✅ Yes	⚠ Partial	⚠ Partial	⚠ Partial	❌	⚠ Experimental
Crash-safe pipeline	✅ Yes	❌	❌	❌	❌	❌
Works on obfuscated malware samples	✅ Designed for it	⚠ Often fails	⚠ Often fails	⚠ Often fails	❌	⚠ Experimental
Parallel processing	✅ Yes	❌	❌	❌	❌	❌
Output is structured for analysis	✅ Yes	❌ Raw code	❌ Raw code	❌ Raw code	❌	❌

Traditional Python reverse engineering tools focus only on decompilation.
PythonUnpackLLM focuses on end-to-end automation, combining deterministic bytecode analysis with AI-assisted interpretation - while maintaining reliability required for large-scale reverse engineering workflows.

Credits & Acknowledgements

Original Research by CyberArk introducing the original ByteCodeLLM concept
pyinstxtractor-ng project for PyInstaller extraction

Disclaimer

This software is provided "as is", without warranty of any kind. This tool is intended for research, defensive security, and reverse-engineering education. Do not analyze software without legal authorization. The author assumes no responsibility for misuse.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
bytecode_handlers		bytecode_handlers
file_extractors		file_extractors
utilities		utilities
PythonUnpackLLM.py		PythonUnpackLLM.py
README.md		README.md
config.py		config.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 PythonUnpackLLM

AI-Powered Python Bytecode Reverse Engineering Framework

Why This Tool Exists

Pipeline Overview

Usage

Extract PYC from exe

Disassemble a single file

Decompile a single file

Decompile entire extracted tree

Key Features

Use Cases

Tool Comparison

Credits & Acknowledgements

Disclaimer

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 PythonUnpackLLM

AI-Powered Python Bytecode Reverse Engineering Framework

Why This Tool Exists

Pipeline Overview

Usage

Extract PYC from exe

Disassemble a single file

Decompile a single file

Decompile entire extracted tree

Key Features

Use Cases

Tool Comparison

Credits & Acknowledgements

Disclaimer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages