I'm trying to find out whether it is possbile to find those datasets whith meaningless text in open ended questions.
I know that I have to look through those open questions manually afterwards but it would be a great help to mark those who have only hack at the keyboard without sense. Is it possivle to search hough the string variables and mark those with a lot of consonants for example? Many thans for your help in advance! |
I'll leave it to people with more Python expertise to make further suggestions.
However, given that so make keep in mind that many people today use texting style, I would be leery of a rule of thumb about too many consonants. 2 mch txtng mks u a bad splr
Art Kendall
Social Research Consultants |
In reply to this post by emma78
Emma, you have to define what 'meaningless' means. I doubt the "too much consonants"-rule is worth the effort but there might be a way with the help of Python and regular expressions. GL, Mario emma78 <[hidden email]> schrieb am 17:21 Montag, 15.Juni 2015: I'm trying to find out whether it is possbile to find those datasets whith meaningless text in open ended questions. I know that I have to look through those open questions manually afterwards but it would be a great help to mark those who have only hack at the keyboard without sense. Is it possivle to search hough the string variables and mark those with a lot of consonants for example? Many thans for your help in advance! -- View this message in context: http://spssx-discussion.1045642.n5.nabble.com/Searching-in-text-fields-tp5729832.html Sent from the SPSSX Discussion mailing list archive at Nabble.com. ===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD
Mario Giesel
Munich, Germany |
Thank you!
what I'm searching for is something like this: eizudoddh A meaningless Sequence of Characters, i Know that can be difficult... |
One way to tackle this would be to find
an open source spelling dictionary such as this one http://svnweb.freebsd.org/base/head/share/dict/web2?view=co
and use it with a little Python code to count the number of misspellings
in the text. Statistics includes a spelling checker that you can
use interactively in the Data Editor, but there isn't an api exposed to
use it programmatically.
You could do this with a few lines of code using SPSSINC TRANS. Jon Peck (no "h") aka Kim Senior Software Engineer, IBM [hidden email] phone: 720-342-5621 From: emma78 <[hidden email]> To: [hidden email] Date: 06/15/2015 10:47 AM Subject: Re: [SPSSX-L] Searching in text fields Sent by: "SPSSX(r) Discussion" <[hidden email]> Thank you! what I'm searching for is something like this: eizudoddh A meaningless Sequence of Characters, i Know that can be difficult... -- View this message in context: http://spssx-discussion.1045642.n5.nabble.com/Searching-in-text-fields-tp5729832p5729836.html Sent from the SPSSX Discussion mailing list archive at Nabble.com. ===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD ===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD |
In reply to this post by emma78
Maybe NLTK or PyEnchant: http://stackoverflow.com/questions/3788870/how-to-check-if-a-word-is-an-english-word-with-python
Gutenberg project might also be useful, e.g. http://www.nltk.org/book/ch02.html Regards, Albert-Jan ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ All right, but apart from the sanitation, the medicine, education, wine, public order, irrigation, roads, a fresh water system, and public health, what have the Romans ever done for us? ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ----- Original Message ----- > From: emma78 <[hidden email]> > To: [hidden email] > Cc: > Sent: Monday, June 15, 2015 6:47 PM > Subject: Re: [SPSSX-L] Searching in text fields > >T hank you! > what I'm searching for is something like this: eizudoddh > A meaningless Sequence of Characters, i Know that can be difficult... > > > > -- > View this message in context: > http://spssx-discussion.1045642.n5.nabble.com/Searching-in-text-fields-tp5729832p5729836.html > > Sent from the SPSSX Discussion mailing list archive at Nabble.com. > > ===================== > To manage your subscription to SPSSX-L, send a message to > [hidden email] (not to SPSSX-L), with no body text except the > command. To leave the list, send the command > SIGNOFF SPSSX-L > For a list of commands to manage subscriptions, send the command > INFO REFCARD > ===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD |
Free forum by Nabble | Edit this page |