CPFAQ: Most Viewed Writers

Writer

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page.

To find questions for the FAQ, I started with a set of relevant Quora topics, and collected the questions under those topics. That gave me a master list of questions. But because of question duplication and other Quora data quality problems, I needed ways to rank the questions in the list so I could focus my efforts on the best ones. One data point I use is follower count. This gives me the questions that the most people are interested in, whether or not they have good answers. This week I’m going to look at the answer upvotes metric, an indicator of answer quality. Since a FAQ has both questions and answers, it’s important to identify good answers as well as good questions.

« Continue »

CPFAQ: Codeforces in Wikipedia

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. In defining FAQ-related concepts like question categories, I always check Wikipedia to see what it has […]

Continue

CPFAQ: Question Categories

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. As I mentioned last week, I have been writing and categorizing canonical questions for the FAQ. […]

Continue

CPFAQ: Canonical Questions

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. In recent weeks, I’ve been using text mining techniques to analyze a set of Quora questions […]

Continue

CPFAQ: Patterns in Question Titles, Part 3

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. I’m mining my Quora question corpus to find patterns and collect data to help write a […]

Continue

CPFAQ: Patterns in Question Titles, Part 2

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. Last week, I did some simple text mining to classify Quora Competitive Programming questions based on […]

Continue

CPFAQ: Patterns in Question Titles

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. Now that I have a database of competitive programming questions, I thought it would be interesting […]

Continue

CPFAQ: SELECT Queries

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. Data in a relational database is often not arranged in a way that makes sense to […]

Continue

CPFAQ: A Question Database, Part 2

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. Last week, I created the first version of a database schema that will store the source […]

Continue

CPFAQ: A Question Database

I’m working on a project this year to build a competitive programming FAQ. This is one in a series of articles describing the research, writing, and tool creation process. To read the whole series, see my CPFAQ category page. So far this year, I’ve been building tools that operate on text files in tab-separate value […]

Continue