Use of the word "P0RN" for display of pics of inst

Board policies and official announcements about Chiff and Fipple and the forums. Please read!
User avatar
Nanohedron
Moderatorer
Posts: 38212
Joined: Wed Dec 18, 2002 6:00 pm
antispam: No
Please enter the next number in sequence: 8
Tell us something.: Been a fluter, citternist, and uilleann piper; committed now to the way of the harp.

Oh, yeah: also a mod here, not a spammer. A matter of opinion, perhaps.
Location: Lefse country

Post by Nanohedron »

Denny wrote:the bots will have the same issues as the lichen filter...
I'm a bit thicker than usual this morning. Could you elucidate?
"If you take music out of this world, you will have nothing but a ball of fire." - Tribal musician
User avatar
Denny
Posts: 24005
Joined: Mon Nov 17, 2003 11:29 am
antispam: No
Location: N of Seattle

Post by Denny »

Honey, you ain't thick, ya just ain't spent no time there. :D

both are software search algorithms that are looking for specific characters or a group of characters (usually with white space on either side, like - blank "p" "o" "r" "n" blank -).

bots would also be looking for .com, .net... a bit cuter is - blank BunchOfStuff "@" BunchOfSruff ".com" blank -

pBB has a bit of software that reads this junk from the box we type it into and formats it for the page (or the preview panel) C&F has added a scan for the string - blank "p" "o" "r" "n" blank - and replaces it with lichen.

notpornstring <--- no blankies
User avatar
Nanohedron
Moderatorer
Posts: 38212
Joined: Wed Dec 18, 2002 6:00 pm
antispam: No
Please enter the next number in sequence: 8
Tell us something.: Been a fluter, citternist, and uilleann piper; committed now to the way of the harp.

Oh, yeah: also a mod here, not a spammer. A matter of opinion, perhaps.
Location: Lefse country

Post by Nanohedron »

Denny wrote:Honey, you ain't thick, ya just ain't spent no time there. :D

both are software search algorithms that are looking for specific characters or a group of characters (usually with white space on either side, like - blank "p" "o" "r" "n" blank -).

bots would also be looking for .com, .net... a bit cuter is - blank BunchOfStuff "@" BunchOfSruff ".com" blank -

pBB has a bit of software that reads this junk from the box we type it into and formats it for the page (or the preview panel) C&F has added a scan for the string - blank "p" "o" "r" "n" blank - and replaces it with lichen.

notpornstring <--- no blankies
:lol: I'm still lost.

I guess what I'm asking is that, in layman's terms, does the bypassing of the filter negate its function? IOW, instead of the intended "lichen", I see the other word. Does the search algorithm benefit, as it were, from this, too?
"If you take music out of this world, you will have nothing but a ball of fire." - Tribal musician
User avatar
Denny
Posts: 24005
Joined: Mon Nov 17, 2003 11:29 am
antispam: No
Location: N of Seattle

Post by Denny »

when in doubt ... test
http://chiffboard.mati.ca/viewtopic.php ... orn#626997
note: the lichen after the highlight= If I edit, and indeed when I pasted it, it is that 4 letter word.

it would appear that the search & replacement thing is run against the search criteria prior to the search being executed.
User avatar
Nanohedron
Moderatorer
Posts: 38212
Joined: Wed Dec 18, 2002 6:00 pm
antispam: No
Please enter the next number in sequence: 8
Tell us something.: Been a fluter, citternist, and uilleann piper; committed now to the way of the harp.

Oh, yeah: also a mod here, not a spammer. A matter of opinion, perhaps.
Location: Lefse country

Post by Nanohedron »

Denny wrote:when in doubt ... test
http://chiffboard.mati.ca/viewtopic.php ... hen#626997
note: the lichen after the highlight= If I edit, and indeed when I pasted it, it is that 4 letter word.

it would appear that the search & replacement thing is run against the search criteria prior to the search being executed.
'Kay. I'm not getting an anwer to my question that I can understand yet, though. Sorry. :cry:
"If you take music out of this world, you will have nothing but a ball of fire." - Tribal musician
User avatar
kkrell
Posts: 4831
Joined: Mon Jul 29, 2002 6:00 pm
antispam: No
Please enter the next number in sequence: 8
Tell us something.: Mostly producer of the Wooden Flute Obsession 3-volume 6-CD 7-hour set of mostly player's choice of Irish tunes, played mostly solo, on mostly wooden flutes by approximately 120 different mostly highly-rated traditional flute players & are mostly...
Location: Los Angeles
Contact:

Post by kkrell »

I believe the point is being made that the search for a bad word (the precursor to 'lichen') looks for very specific instances of the word, and is easily beaten (for instance by substituting the number zero for the letter 'o'), as well as other common measures spammers use to get past email filters. Other services scanning the C&F forums may or may not be more sophisticated in identifying the bad word. So Google, its ads, office bad word filters, etc. might not be as easily fooled. So, C&F might be blocked from certain machines, in offices, libraries, etc. regardless of C&F administrators attempt to keep things a little more family friendly.

Kevin Krell
International Traditional Music Society, Inc.
A non-profit 501c3 charity/educational public benefit corporation
Wooden Flute Obsession CDs (3 volumes, 6 discs, 7 hours, 120 players/tracks)
https://www.worldtrad.org
User avatar
Denny
Posts: 24005
Joined: Mon Nov 17, 2003 11:29 am
antispam: No
Location: N of Seattle

Post by Denny »

The "filter" is only done on the way to the web page.
This is a sub task to the bit that translates the junk in the "Text area" (I's typing in one of them right now)
The tags (bold, size, color, quote, HTML ,etc.) are all parsed and changed to attributes, instead of text pictures are fetched....bunch o'stuff..."filter" is a piece if it.
User avatar
kkrell
Posts: 4831
Joined: Mon Jul 29, 2002 6:00 pm
antispam: No
Please enter the next number in sequence: 8
Tell us something.: Mostly producer of the Wooden Flute Obsession 3-volume 6-CD 7-hour set of mostly player's choice of Irish tunes, played mostly solo, on mostly wooden flutes by approximately 120 different mostly highly-rated traditional flute players & are mostly...
Location: Los Angeles
Contact:

Post by kkrell »

Denny wrote:The "filter" is only done on the way to the web page.
This is a sub task to the bit that translates the junk in the "Text area" (I's typing in one of them right now)
The tags (bold, size, color, quote, HTML ,etc.) are all parsed and changed to attributes, instead of text pictures are fetched....bunch o'stuff..."filter" is a piece if it.
Now I'm dense. Are you saying that the word substitution only occurs on display of the C&F thread to the person viewing it (a run-time conversion as it is viewed)? But the bad content is actually still in the database as it was typed? Thus, other functions that directly access the database of messages will still find the bad word (which might include search functions, 'bots, etc.)?

Kevin Krell
International Traditional Music Society, Inc.
A non-profit 501c3 charity/educational public benefit corporation
Wooden Flute Obsession CDs (3 volumes, 6 discs, 7 hours, 120 players/tracks)
https://www.worldtrad.org
User avatar
Denny
Posts: 24005
Joined: Mon Nov 17, 2003 11:29 am
antispam: No
Location: N of Seattle

Post by Denny »

kkrell wrote:Are you saying that the word substitution only occurs on display of the C&F thread to the person viewing it (a run-time conversion as it is viewed)? But the bad content is actually still in the database as it was typed?
I'm saying that based on how a couple of searches looked to me that the above is true.
kkrell wrote:Thus, other functions that directly access the database of messages will still find the bad word (which might include search functions, 'bots, etc.)?
I do not believe that bots directly access the database. I think that they read the returned HTML, just like the browser.
User avatar
Denny
Posts: 24005
Joined: Mon Nov 17, 2003 11:29 am
antispam: No
Location: N of Seattle

Post by Denny »

lichen

quote this post...
User avatar
rich
i see what you did there
Posts: 609
Joined: Mon May 14, 2001 6:00 pm
Please enter the next number in sequence: 1
Location: Toronto, Ontario
Contact:

Post by rich »

Denny wrote:lichen

quote this post...
(The quote talked about the moss.)

It's stored in the database as the unfiltered word because otherwise there'd be no way to make a word-filter affect previous posts without going through and changing them all when the filter is turned on, and there'd be no way to ever turn a word-filter off.

Bots and search engines don't see the database, they see the same thing you see. The only place I can think of where the unfiltered words appear is in the RSS feed, and that's a "bug" in that I think the author of the RSS feed plugin forgot to apply filtering.
User avatar
Denny
Posts: 24005
Joined: Mon Nov 17, 2003 11:29 am
antispam: No
Location: N of Seattle

Post by Denny »

:D so quote does :D
edit is not filtered

Ew, I forgot about the RSS feed... that's a bit tacky!
User avatar
I.D.10-t
Posts: 7657
Joined: Wed Dec 17, 2003 9:57 am
antispam: No
Location: Minneapolis, MN, USA, Earth

Post by I.D.10-t »

Try doing a Google search on my exact words quoted by Nanohedron.

Google recognizes it, does their search engine work all that differently than a filter? I think that Nanohedron's initial concerns are valid. (Okay, rich answered the question with more authority, but I am a proof of concept kind of person)

Just to clarify, I believe that the automated substitution was put into place not to reduce spam bots, but because filtering software blocks sites that have bad words. As I understand it, my stunt would not allow some people to read page 2.


~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
As a side note I had found the feature when trying to figure out how to explain how to post images (see below). It works to break up a complex post where normal options like "Disable BBCode in this post" just won’t work.

[img]http://chiffboard.mati.ca/images/smiles ... le_144.gif[/img]
"Be not deceived by the sweet words of proverbial philosophy. Sugar of lead is a poison."
Post Reply