This project is part of Advanced Topics in Machine Learning subject. Further detailed description of the project can be known in the documentation of the Project.
Consider this set of books belonging to the 19th Century English Fiction 1.
The data set is created from Project Gutenberg2. The data set consists of about 1000 books and roughly 10 genres. The task here consists of detection (i.e. multi-class classification) of genre3 of a book. Each data-point in this classification task is a fiction book with a label (genre). Please note the following three main challenges tackled: