Add Homoglyphs detection in Minder #2312

teodor-yanev · 2024-02-08T21:12:37Z

Closes: #2121

Please refer to #2121 for a full description of the work covered in this PR. There will be a second short one for adding the rule types and a profile.

Reference to the PR and repo used for testing the feature: teodor-yanev/a-testrepo#1

A couple of notes here regarding the implementation:

Considering the ongoing work (Create single status comment and correctly dismiss reviews #2171) and the tightly coupled (to the "vulncheck" dependency checks for ecosystems functionality) way of "review" related calls and string constants, I decided to opt-in for an "extract and refactor" approach which can be seen in "reviewer.go" and the related functions and files. After we reach a consensus on generalising these calls, we can come back to this work and address the necessary changes.
Two rule types instead of one with parameters: an entirely subjective decision, this approach gives more flexibility when we want to customise each rule further.
Two evaluators behind one "main" 'Homoglyphs' evaluator: The Homoglyphs evaluator presents a type field which gives us a direction on which functionality we need, that's it. There are two for now but there might be more in the future. While they are still classified under the same type of "Evaluator", they are still two separate logical entities, providing a different set of results, and making decisions based on different parameters.

Please note that the PR size isn't as scary as it looks: the scripts.txt file is ~3k lines alone and we also have proto generations.

JAORMX · 2024-02-08T21:15:24Z

6k PR daaaaaaaaaaamn

internal/engine/ingester/diff/diff.go

dmjb · 2024-02-09T12:33:37Z

internal/engine/eval/homoglyphs/application/mixed_scripts_eval.go

+}
+
+// Eval evaluates the mixed scripts rule type
+func (mse *MixedScriptsEvaluator) Eval(ctx context.Context, _ map[string]any, res *engif.Result) error {


The Eval method of the these two evaluation strategies seem to share mostly identical code. I suspect it's possible to refactor them to share a common function. From what I can see, the only things which seem to change between the two Eval methods is:

The type of processor and the method of the processor which carries out the evaluation.

The message written in the PR comment

The status/message in the PR review

One solution for this would be to add a common interface for MixedScriptsEvaluator and InvisibleCharactersEvaluator which looks something like this:

type HomoglyphEvaluator interface { FindViolations(codeLine string) InfoType GetLineCommentText() string GetFailedReviewText() string GetPassedReviewText() string }

(I am not 100% sure of the types involved here, and there may be better names than the ones in my example)

At which point you could write a common function with a signature such as:

func checkForHomoglyphViolation(ctx context.Context, evaluator HomoglyphEvaluator, res *engif.Result) errror

Which will mostly look like the current code, except that it delegates the methods of the HomoglyphEvaluator struct instead of having hard-coded evaluator-specific references. The Eval methods of both structs simply calls into the common function with the appropriate arguments.

Thanks for the descriptive suggestion and the discussion that followed in Slack!
I've addressed your comments.

add: homoglyphs detection

bf0b42b

teodor-yanev requested a review from a team as a code owner February 8, 2024 21:12

teodor-yanev self-assigned this Feb 8, 2024

github-advanced-security bot found potential problems Feb 8, 2024

View reviewed changes

internal/engine/ingester/diff/diff.go Fixed Show fixed Hide fixed

teodor-yanev added 2 commits February 8, 2024 23:49

fix: integer overflow vuln + lint comments

b4e04a3

fix: unused function param

7118628

teodor-yanev mentioned this pull request Feb 9, 2024

Full diff ingestor #2325

Merged

teodor-yanev added 2 commits February 9, 2024 13:14

Merge branch 'main' into add-minder-homoglyphs-detection

54978b2

fix: address more lint checks

6aa81e2

dmjb requested changes Feb 9, 2024

View reviewed changes

teodor-yanev added 4 commits February 9, 2024 20:45

update: refactoring

17ae5a5

add: license

a796adf

update: address lints

eab1c5a

update: redundant comments and lint

63d05d8

teodor-yanev requested a review from dmjb February 9, 2024 19:14

dmjb approved these changes Feb 11, 2024

View reviewed changes

teodor-yanev merged commit 067a3b7 into main Feb 11, 2024
19 checks passed

teodor-yanev deleted the add-minder-homoglyphs-detection branch February 11, 2024 19:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Homoglyphs detection in Minder #2312

Add Homoglyphs detection in Minder #2312

teodor-yanev commented Feb 8, 2024 •

edited

Loading

JAORMX commented Feb 8, 2024

dmjb Feb 9, 2024 •

edited

Loading

teodor-yanev Feb 9, 2024

Add Homoglyphs detection in Minder #2312

Add Homoglyphs detection in Minder #2312

Conversation

teodor-yanev commented Feb 8, 2024 • edited Loading

JAORMX commented Feb 8, 2024

dmjb Feb 9, 2024 • edited Loading

Choose a reason for hiding this comment

teodor-yanev Feb 9, 2024

Choose a reason for hiding this comment

teodor-yanev commented Feb 8, 2024 •

edited

Loading

dmjb Feb 9, 2024 •

edited

Loading