Ask About C3 W2 Content based Filtering Data #51
Replies: 2 comments
-
After pulling the repo and running the content-based filtering code, I noticed something odd with the data files—especially Let me break down the files as I’ve come to understand them. Content Item and User Files Overview
Why Duplications Exist in
|
Beta Was this translation helpful? Give feedback.
-
It actually truth cuz when i try to implement the unique chunk it, give out alot higher loss value. But do you have the code that gen those file, or understand the pattern off the generating process of those file. Since some chunk duplicate 131 times, some only 16 times, and the |
Beta Was this translation helpful? Give feedback.
-
Hey guys, im new to the AI department, I try to implement the code from the content based filtering which use neural network on my app. I pull the repo and run the code successfully but i find it hard to understand the data, all those
content_item_train.csv
,content_item_train.csv
andcontent_user_train.csv
.when i look in it the
content_item_train.csv
and thecontent_user_train.csv
. have the same 58187 row (for the model.fit) but why it duplicate a lot like for this chunkthis chunk is one hot vector for movie id 6874 but it got duplicated like 131 times in the
content_item_train.csv
the
content_item_vecs.csv
is just the train without duplicated and thecontent_user_train.csv
not duplicate chunk but in each chunk every rows is the same.Hope you guys can answer it soon.
Beta Was this translation helpful? Give feedback.
All reactions