When running the following code snippet on AWS Glue:
results_candidate_pairs = anonlink.candidate_generation.find_candidate_pairs(
[
...
],
[
...
]
anonlink.similarities.dice_coefficient_accelerated,
0.9,
)
I encounter the following error:
ValueError: Buffer dtype mismatch, expected 'const char' but got 'signed char'
I tried using anonlink.similarities.dice_coefficient_accelerated_python as an alternative, and it did not produce the error. However, this alternative is significantly slower, making it impractical for large datasets.