External References
References to articles
As you will see exploring SparkER, there is a big and heavy theoretical background with everything this library uses. Since it would require a bit bit more than just few lines to explain every definition and action of SparkER, we will give you some references to articles published by developers and professors about the mathematical and algorithmical tools used here.
References:
Simonini, G., Bergamaschi, S., Jagadish, H. V. (2016). BLAST: a Loosely Schema-aware Meta-blocking Approach for Entity Resolution. PVLDB, 9(12), 1173–1184
Papadakis, G., Koutrika, G., Palpanas, T., Nejdl, W. (2014). Meta-blocking: Taking entity resolution to the next level. IEEE TKDE.
Papadakis, G., Papastefanatos, G., Palpanas, T., Koubarakis, M., Green, E. L. (2016). Scaling Entity Resolution to Large , Heterogeneous Data with Enhanced Meta-blocking, 221–232. IEEE TKDE
Papadakis, G., Ioannou, E., Niederée, C., Fankhauser, P. (2011). Efficient entity resolution for large heterogeneous information spaces. Proceedings of the Fourth ACM International Conference on Web Search and Data Mining - WSDM ’11, 535
Gagliardelli, L., Zhu, S., Simonini, G., Bergamaschi, S. (2018). Bigdedup: a Big Data integration toolkit for duplicate detection in industrial scenarios. In 25th International Conference on Transdisciplinary Engineering (TE2018) (Vol. 7, pp. 1015-1023)