It is of great value to answer product questions based on heterogeneous information sources available on web product pages, e.g., semistructured attributes, text descriptions, user-provided contents, etc. However, these sources have different structures and writing styles, which poses challenges for (1) evidence ranking, (2) source selection, and (3) answer generation. In this paper, we build a benchmark