CPC G06F 40/284 (2020.01) [G06F 16/9032 (2019.01); G06F 16/90344 (2019.01); G06F 40/289 (2020.01); G06Q 10/1053 (2013.01)] | 12 Claims |
1. A computer-implemented method comprising:
determining, by one or more processors, sentences in a job description; and
for each sentence:
checking, by the one or more processors, for a match of the sentence to one of a plurality of seed phrases, each seed phrase being a sequence of one or more words that is associated with a category of talent-screening questions, wherein checking for the match comprises:
performing a first match operation to determine matches in the sentence to seed phrases with three or more words; and
performing a second match operation to determine the sentences with requirement words that match seed phrases with two or more words;
based on the match, identifying, by the one or more processors, a talent-screening question for the sentence;
adding the second seed phrases to the first seed phrases to obtain third seed phrases, each seed phrase being a sequence of one or more words that is associated with a category of talent-screening questions;
wherein checking for a match comprises a third match operation, the third match operation comprising finding sentences comprising a word appearing in the seed phrases, a second word that is associated with the word appearing in the seed phrases, and a requirement word; and
wherein finding sentences comprising a word appearing in the seed phrases comprises creating an inverted-index word map of the seed phrases by tokenizing the seed phrases into words and associating each word with the other words that appear in the seed phrases, wherein finding sentences comprising the second word associated with the word appearing in the seed phrases is based on the inverted-index word map of the seed phrases.
|