CPFAQ: Merging Questions, Part 2

Merge

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page.

One result of classifying many Quora questions is finding many duplicates. Quora knows about this and provides a Merge function. But as I have written about before, there’s also a content review bot that unmerges questions it thinks are not similar enough. I did some more investigation into this bot’s behavior, which I’ll describe this week.

« Continue »

CPFAQ: Question Categories, Part 2

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. Using my QuoraClassifier tool, I’ve gotten about 25% of the way through my question list. So […]

Continue

CPFAQ: Fast Classification, Part 2

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. I’m writing a tool called QuoraClassifier to speed up the process of organizing the list of […]

Continue

CPFAQ: Fast Classification

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. Over the past few months, I’ve been classifying a list of Quora questions. Each question gets […]

Continue

CPFAQ: Scraping with Selenium

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. When you’re logged in to Quora, you see more information than an anonymous user does. For […]

Continue

CPFAQ: Canonical Question Statistics, Part 2

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. I have now classified over 1000 Quora questions, using 552 canonical titles, and I think that’s […]

Continue

CPFAQ: Good Answers to Bad Questions

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. As I mentioned at the end of last week’s post, it’s hard to write a good […]

Continue

CPFAQ: The Value of Canonical Questions

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. Last week I discussed how question merging works for Quora and CPFAQ. Related to question merging […]

Continue

CPFAQ: Merging Questions

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. If a CPFAQ page has a canonical title and contains a list of Quora questions that […]

Continue

CPFAQ: Canonical Question Statistics

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. As I mentioned last week, I’m currently creating FAQ pages, and those FAQ pages rely on […]

Continue