What Is a Conditional Generative Adversarial Network?

The rise of Generative Artificial Intelligence (GenAI) has introduced innovative services and cutting-edge tools to automate tasks, optimize processes, and speed up transactions. These benefits make it more enticing for businesses to deploy AI services for their expansion and growth strategies.

One important technological breakthrough that has made this growth possible is the conditional generative adversarial network (CGAN).

5 Papers on Product Classification Every Data Scientist Should Read

Product categorization/product classification is the organization of products into their respective departments or categories. A large part of the process is the design of the product taxonomy as a whole. 

Product categorization was initially a text classification task that analyzed the product’s title to choose the appropriate category. However, numerous methods have been developed which take into account the product title, description, images, and other available metadata. 

The following papers on product categorization represent essential reading in the field and offer novel approaches to product classification tasks.

1. Don’t Classify, Translate

In this paper, researchers from the National University of Singapore and the Rakuten Institute of Technology propose and explain a novel machine translation approach to product categorization. The experiment uses the Rakuten Data Challenge and Rakuten Ichiba datasets. 

Their method translates or converts a product’s description into a sequence of tokens which represent a root-to-leaf path to the correct category. Using this method, they are also able to propose meaningful new paths in the taxonomy.

The researchers state that their method outperforms many of the existing classification algorithms commonly used in machine learning today.
  • Published/Last Updated – Dec. 14, 2018
  • Authors and Contributors – Maggie Yundi Li (National University of Singapore), Stanley Kok (National University of Singapore), and Liling Tan (Rakuten Institute of Technology)

2. Large-Scale Categorization of Japanese Product Titles Using Neural Attention Models

The authors of this paper propose attention convolutional neural network (ACNN) models over baseline convolutional neural network (CNN) models and gradient boosted tree (GBT) classifiers. 

The study uses Japanese product titles taken from Rakuten Ichiba as training data. Using this data, the authors compare the performance of the three methods (ACNN, CNN, and GBT) for large-scale product categorization. 

While differences in inaccuracy can be less than 5%, even minor improvements in accuracy can result in millions of additional correct categorizations. 

Lastly, the authors explain how an ensemble of ACNN and GBT models can further minimize false categorizations.


  • Published/Last Updated – April, 2017 for EACL 2017
  • Authors and Contributors – From the Rakuten Institute of Technology: Yandi Xia, Aaron Levine, Pradipto Das Giuseppe Di Fabbrizio, Keiji Shinzato and Ankur Datta 

3. Atlas: A Dataset and Benchmark for Ecommerce Clothing Product Classification

Researchers at the University of Colorado and Ericsson Research (Chennai, India) have created a large product dataset known as Atlas. In this paper, the team presents its dataset which includes over 186,000 images of clothing products along with their product titles. 


An Introduction to 5 Types of Image Annotation

Looking for information on the different image annotation types? In the world of AI and machine learning, data is king. Without data, there can be no data science. For AI developers and researchers to achieve the ambitious goals of their projects, they need access to enormous amounts of high-quality data. In regards to image data, one major field of machine learning that requires large amounts of annotated images in computer vision.

Table of Contents

Machine Learning Youtube Videos for Data Scientists (Under 10 min.)

Machine learning educational content is often in the form of academic papers or blog articles. These resources are incredibly valuable. However, they can sometimes be lengthy and time-consuming. If you just want to learn basic concepts and don’t require all the math and theory behind them, concise machine learning videos may be a better option. The Youtube videos on this list cover concepts such as what machine learning is, the basics of natural language processing, how computer vision works, and machine learning in video games. 

1. What Is Machine Learning? | Machine Learning Basics


10 Best Text Annotation Tools and Services for Machine Learning

From search engines and sentiment analysis to virtual assistants and chatbots, there are numerous areas of research within machine learning that require text annotation tools and services.

In the AI research and development industries, annotated data is gold. Large quantities of high-quality annotated data is a goldmine. On the other hand, sometimes finding or creating this data can be an expensive and arduous task for your team. Fortunately, there are a variety of text annotation tools and services available that can provide you with the data you need. Some of these services include entity extraction, part-of-speech tagging, sentiment analysis, and more.