Now, it's time for me to officially move away from IDA to #Ghidra everything I have. I will start with Magic Strings, then move to port #Diaphora.
Thank you very much to the person that donated 100 euros to the #Diaphora project, it's highly appreciated!
The Reverse Engineering community has spoken. #Diaphora will be ported to #Ghidra in the next months. I would love to have it working properly by the end of the year, but I cannot be sure. So, no ETA for now.
Thank you very much to the person that donated 25 euros to #Diaphora!
Question about IDA and Ghidra considering the latest IDA license changes, regarding #Diaphora: If a (highly integrated) Ghidra port of Diaphora appears "soon", would you...
Only use the IDA version for <= 8.4
7.7%
Only use the IDA version for >= 9.0
7.7%
Only use the Ghidra version
69.2%
Use both IDA and Ghidra, regardless of versions
15.4%
Poll ended at .

Let me explain you the idea and the problem.

The Idea: Put a service for functions signatures using #Diaphora technologies online or a SQLite database file that can be easily downloaded somewhere for offline usage.

The Problem: Github doesn't allow files bigger than 100 MB. The initial signatures database I have is 1,2 GB, but it will grow a bit (I don't think it will be ever over 10 GB in years and years).

That explained, any recommendation about how to solve this problem with my resources?

What is a *cheap* way to put an online service that requires a database that will be under ~10 GB for a number of years? If it's free, better.

#Diaphora

I haven't published a new #Diaphora release just yet because the code is still a bit experimental and there are some problems that I'm trying to fix like, for example: SciKit Learn distributed models cannot be used if the version of the training machine and the machine where it's being used (where Diaphora runs) are different.

The code is also published (in github) already and #Diaphora now can use an already trained model to try to improve binary diffing results (matching). I haven't made yet a new release just yet as these changes are considered a bit experimental for now.

The datasets and tools for training and testing are here: https://github.com/joxeankoret/diaphora-ml
And Diaphora, is here: https://github.com/joxeankoret/diaphora

#Diaphora #BinaryDiffing #Bindiffing #ReverseEngineering #MachineLearning

GitHub - joxeankoret/diaphora-ml: Diaphora Machine Learning tools and datasets

Diaphora Machine Learning tools and datasets. Contribute to joxeankoret/diaphora-ml development by creating an account on GitHub.

GitHub

Here are the slides of my "Simple Machine Learning Techniques for Binary Diffing (with Diaphora)" talk given at the @44CON conference last week:

https://github.com/joxeankoret/diaphora-ml/blob/main/docs/diaphora-ml-techniques-44con-final.pdf

#44con #Diaphora #MachineLearning #ReverseEngineering #BinaryDiffing

diaphora-ml/docs/diaphora-ml-techniques-44con-final.pdf at main · joxeankoret/diaphora-ml

Diaphora Machine Learning tools and datasets. Contribute to joxeankoret/diaphora-ml development by creating an account on GitHub.

GitHub