Searching in text fields

classic Classic list List threaded Threaded
7 messages Options
Reply | Threaded
Open this post in threaded view
|

Searching in text fields

emma78
I'm trying to find out whether it is possbile to find those datasets whith meaningless text in open ended questions.
I know that I have to look through those open questions manually afterwards but it would be a great help to mark those who have only hack at the keyboard without sense.
Is it possivle to search hough the string variables and mark those with a lot of consonants for example?


Many thans for your help in advance!
Reply | Threaded
Open this post in threaded view
|

Re: Searching in text fields

Art Kendall
I'll leave it to people with more Python expertise to make further suggestions.

However, given that so make keep in mind that many people today use texting style, I would be leery of
a rule of thumb about too many consonants.

2 mch txtng mks u a bad splr

Art Kendall
Social Research Consultants
Reply | Threaded
Open this post in threaded view
|

Re: Searching in text fields

Mario Giesel
In reply to this post by emma78
Emma, you have to define what 'meaningless' means. 
I doubt the "too much consonants"-rule is worth the effort but there might be a way with the help of Python and regular expressions.

GL,
  Mario



emma78 <[hidden email]> schrieb am 17:21 Montag, 15.Juni 2015:


I'm trying to find out whether it is possbile to find those datasets whith
meaningless text in open ended questions.
I know that I have to look through those open questions manually afterwards
but it would be a great help to mark those who have only hack at the
keyboard without sense.
Is it possivle to search hough the string variables and mark those with a
lot of consonants for example?


Many thans for your help in advance!



--
View this message in context: http://spssx-discussion.1045642.n5.nabble.com/Searching-in-text-fields-tp5729832.html
Sent from the SPSSX Discussion mailing list archive at Nabble.com.

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD


===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD
Mario Giesel
Munich, Germany
Reply | Threaded
Open this post in threaded view
|

Re: Searching in text fields

emma78
Thank you!
what I'm searching for is something like this: eizudoddh
A meaningless Sequence of Characters, i Know that can be difficult...
Reply | Threaded
Open this post in threaded view
|

Re: Searching in text fields

Jon K Peck
One way to tackle this would be to find an open source spelling dictionary such as this one http://svnweb.freebsd.org/base/head/share/dict/web2?view=co and use it with a little Python code to count the number of misspellings in the text.  Statistics includes a spelling checker that you can use interactively in the Data Editor, but there isn't an api exposed to use it programmatically.

You could do this with a few lines of code using SPSSINC TRANS.


Jon Peck (no "h") aka Kim
Senior Software Engineer, IBM
[hidden email]
phone: 720-342-5621




From:        emma78 <[hidden email]>
To:        [hidden email]
Date:        06/15/2015 10:47 AM
Subject:        Re: [SPSSX-L] Searching in text fields
Sent by:        "SPSSX(r) Discussion" <[hidden email]>




Thank you!
what I'm searching for is something like this: eizudoddh
A meaningless Sequence of Characters, i Know that can be difficult...



--
View this message in context:
http://spssx-discussion.1045642.n5.nabble.com/Searching-in-text-fields-tp5729832p5729836.html
Sent from the SPSSX Discussion mailing list archive at Nabble.com.

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD


===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD
Reply | Threaded
Open this post in threaded view
|

Re: Searching in text fields

Albert-Jan Roskam-2
In reply to this post by emma78
Maybe NLTK or PyEnchant:  http://stackoverflow.com/questions/3788870/how-to-check-if-a-word-is-an-english-word-with-python
Gutenberg project might also be useful, e.g. http://www.nltk.org/book/ch02.html

Regards,

Albert-Jan




~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

All right, but apart from the sanitation, the medicine, education, wine, public order, irrigation, roads, a

fresh water system, and public health, what have the Romans ever done for us?

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~




----- Original Message -----

> From: emma78 <[hidden email]>
> To: [hidden email]
> Cc:
> Sent: Monday, June 15, 2015 6:47 PM
> Subject: Re: [SPSSX-L] Searching in text fields
>
>T hank you!
> what I'm searching for is something like this: eizudoddh
> A meaningless Sequence of Characters, i Know that can be difficult...
>
>
>
> --
> View this message in context:
> http://spssx-discussion.1045642.n5.nabble.com/Searching-in-text-fields-tp5729832p5729836.html
>
> Sent from the SPSSX Discussion mailing list archive at Nabble.com.
>
> =====================
> To manage your subscription to SPSSX-L, send a message to
> [hidden email] (not to SPSSX-L), with no body text except the
> command. To leave the list, send the command
> SIGNOFF SPSSX-L
> For a list of commands to manage subscriptions, send the command
> INFO REFCARD
>

=====================
To manage your subscription to SPSSX-L, send a message to
[hidden email] (not to SPSSX-L), with no body text except the
command. To leave the list, send the command
SIGNOFF SPSSX-L
For a list of commands to manage subscriptions, send the command
INFO REFCARD
Reply | Threaded
Open this post in threaded view
|

Re: Searching in text fields

emma78
Thank you