At the moment, the lack of dataset, model weights, and key processing details makes it difficult for the community to validate the claims presented in the project.
Could you clarify whether the results shown are based on a fully implemented system, or are they conceptual / experimental demonstrations?
Providing additional details would greatly improve the credibility and reproducibility of this work.