Data for: Words are important: A textual content based identity resolution scheme across multiple online social networks

Main Author: Srivastava, Deepesh
Format: Dataset
Terbitan: Mendeley , 2020
Subjects:
Online Access: https:/data.mendeley.com/datasets/rcpxp7m3tn
Daftar Isi:
  • 1. Training Datasets: 1.1. Columns details: columns represent the extracted features from the content of source profile (Twitter) and the target profiles (Facebook). Last column shows the match and no-match condition. Total number of columns in this dataset are 31. 1.2. Rows details: all the rows in this dataset represent the source and target profiles pairs for match and no-match. Total number of rows in this dataset are 31882. 2. Test Datasets: 2.1. Columns details: columns represent the extracted features from the content of source profile (Twitter)and the target profiles (Facebook). Last column shows the match and no-match condition. Total number of columns in this dataset are 31. 2.2. Rows details: all the rows in this dataset represent the source and target profiles pairs for match and no-match. Total number of rows in this dataset are 17392.