Perfume Co-Preference Network
The **Perfume Co-Preference Network ** dataset comprises comprehensive user reviews and ratings collected from the Persian retail platform Atrafshan. This dataset, central to our research on community detection in fragrance preferences, includes 36,434 comments from 7,387 unique users, providing insights into consumer sentiment towards various perfumes. It is designed to facilitate the analysis of user preferences through sentiment analysis, allowing for the clustering of perfumes based on shared attributes.
The dataset features three main components:
-
User Reviews and Perfume Attributes Dataset: This captures user sentiments expressed in comments, along with metadata such as user IDs, perfume details, and ratings across key attributes (scent, longevity, sillage, and design).
-
Emoji Mapping Dataset: This includes 392 common emojis mapped to their Persian equivalents to enhance sentiment analysis accuracy.
-
Sentiment Classification Results: This section includes three CSV files that detail sentiment classifications biased toward specific perfume attributes: Scent, Longevity, and Sillage. These classifications are derived from user comments using the ParsBert model, integrating user ratings to provide a nuanced understanding of consumer preferences.
For access to the dataset and further details, please visit our GitHub repository.
Total number of user comments: 36,434
Total number of unique users: 7,387
Number of emojis in mapping: 392
Number of CSV files with sentiment classifications: 3