It is currently Thu Oct 01, 2020 10:53 am

All times are UTC - 6 hours




Post new topic Reply to topic  [ 43 posts ]  Go to page Previous  1, 2, 3
Author Message
 
 Post subject:
PostPosted: Sun May 27, 2007 12:25 pm 
Offline
Moderatorer
User avatar

Joined: Wed Dec 18, 2002 6:00 pm
Posts: 36002
Location: Among the pixels
Denny wrote:
the bots will have the same issues as the lichen filter...


I'm a bit thicker than usual this morning. Could you elucidate?

_________________
"Time is the wisest counselor of all." - Pericles

"I remain not entirely convinced of it." - Nano


Top
 Profile  
Reply with quote  
 
 Post subject:
PostPosted: Sun May 27, 2007 1:42 pm 
Offline
User avatar

Joined: Mon Nov 17, 2003 11:29 am
Posts: 24005
Location: N of Seattle
Honey, you ain't thick, ya just ain't spent no time there. :D

both are software search algorithms that are looking for specific characters or a group of characters (usually with white space on either side, like - blank "p" "o" "r" "n" blank -).

bots would also be looking for .com, .net... a bit cuter is - blank BunchOfStuff "@" BunchOfSruff ".com" blank -

pBB has a bit of software that reads this junk from the box we type it into and formats it for the page (or the preview panel) C&F has added a scan for the string - blank "p" "o" "r" "n" blank - and replaces it with lichen.

notpornstring <--- no blankies


Top
 Profile  
Reply with quote  
 
 Post subject:
PostPosted: Sun May 27, 2007 2:01 pm 
Offline
Moderatorer
User avatar

Joined: Wed Dec 18, 2002 6:00 pm
Posts: 36002
Location: Among the pixels
Denny wrote:
Honey, you ain't thick, ya just ain't spent no time there. :D

both are software search algorithms that are looking for specific characters or a group of characters (usually with white space on either side, like - blank "p" "o" "r" "n" blank -).

bots would also be looking for .com, .net... a bit cuter is - blank BunchOfStuff "@" BunchOfSruff ".com" blank -

pBB has a bit of software that reads this junk from the box we type it into and formats it for the page (or the preview panel) C&F has added a scan for the string - blank "p" "o" "r" "n" blank - and replaces it with lichen.

notpornstring <--- no blankies


:lol: I'm still lost.

I guess what I'm asking is that, in layman's terms, does the bypassing of the filter negate its function? IOW, instead of the intended "lichen", I see the other word. Does the search algorithm benefit, as it were, from this, too?

_________________
"Time is the wisest counselor of all." - Pericles

"I remain not entirely convinced of it." - Nano


Top
 Profile  
Reply with quote  
 
 Post subject:
PostPosted: Sun May 27, 2007 2:13 pm 
Offline
User avatar

Joined: Mon Nov 17, 2003 11:29 am
Posts: 24005
Location: N of Seattle
when in doubt ... test
http://chiffboard.mati.ca/viewtopic.php ... orn#626997
note: the lichen after the highlight= If I edit, and indeed when I pasted it, it is that 4 letter word.

it would appear that the search & replacement thing is run against the search criteria prior to the search being executed.


Top
 Profile  
Reply with quote  
 
 Post subject:
PostPosted: Sun May 27, 2007 2:32 pm 
Offline
Moderatorer
User avatar

Joined: Wed Dec 18, 2002 6:00 pm
Posts: 36002
Location: Among the pixels
Denny wrote:
when in doubt ... test
http://chiffboard.mati.ca/viewtopic.php ... hen#626997
note: the lichen after the highlight= If I edit, and indeed when I pasted it, it is that 4 letter word.

it would appear that the search & replacement thing is run against the search criteria prior to the search being executed.


'Kay. I'm not getting an anwer to my question that I can understand yet, though. Sorry. :cry:

_________________
"Time is the wisest counselor of all." - Pericles

"I remain not entirely convinced of it." - Nano


Top
 Profile  
Reply with quote  
 
 Post subject:
PostPosted: Sun May 27, 2007 3:10 pm 
Offline
User avatar

Joined: Mon Jul 29, 2002 6:00 pm
Posts: 4305
Location: Los Angeles
I believe the point is being made that the search for a bad word (the precursor to 'lichen') looks for very specific instances of the word, and is easily beaten (for instance by substituting the number zero for the letter 'o'), as well as other common measures spammers use to get past email filters. Other services scanning the C&F forums may or may not be more sophisticated in identifying the bad word. So Google, its ads, office bad word filters, etc. might not be as easily fooled. So, C&F might be blocked from certain machines, in offices, libraries, etc. regardless of C&F administrators attempt to keep things a little more family friendly.

Kevin Krell

_________________
International Traditional Music Society, Inc.
A non-profit 501c3 charity/educational public benefit corporation
Wooden Flute Obsession CDs (3 volumes, 6 discs, 7 hours, 120 players/tracks)
http://www.worldtrad.org


Top
 Profile  
Reply with quote  
 
 Post subject:
PostPosted: Sun May 27, 2007 3:12 pm 
Offline
User avatar

Joined: Mon Nov 17, 2003 11:29 am
Posts: 24005
Location: N of Seattle
The "filter" is only done on the way to the web page.
This is a sub task to the bit that translates the junk in the "Text area" (I's typing in one of them right now)
The tags (bold, size, color, quote, HTML ,etc.) are all parsed and changed to attributes, instead of text pictures are fetched....bunch o'stuff..."filter" is a piece if it.


Top
 Profile  
Reply with quote  
 
 Post subject:
PostPosted: Sun May 27, 2007 3:20 pm 
Offline
User avatar

Joined: Mon Jul 29, 2002 6:00 pm
Posts: 4305
Location: Los Angeles
Denny wrote:
The "filter" is only done on the way to the web page.
This is a sub task to the bit that translates the junk in the "Text area" (I's typing in one of them right now)
The tags (bold, size, color, quote, HTML ,etc.) are all parsed and changed to attributes, instead of text pictures are fetched....bunch o'stuff..."filter" is a piece if it.


Now I'm dense. Are you saying that the word substitution only occurs on display of the C&F thread to the person viewing it (a run-time conversion as it is viewed)? But the bad content is actually still in the database as it was typed? Thus, other functions that directly access the database of messages will still find the bad word (which might include search functions, 'bots, etc.)?

Kevin Krell

_________________
International Traditional Music Society, Inc.
A non-profit 501c3 charity/educational public benefit corporation
Wooden Flute Obsession CDs (3 volumes, 6 discs, 7 hours, 120 players/tracks)
http://www.worldtrad.org


Top
 Profile  
Reply with quote  
 
 Post subject:
PostPosted: Sun May 27, 2007 5:45 pm 
Offline
User avatar

Joined: Mon Nov 17, 2003 11:29 am
Posts: 24005
Location: N of Seattle
kkrell wrote:
Are you saying that the word substitution only occurs on display of the C&F thread to the person viewing it (a run-time conversion as it is viewed)? But the bad content is actually still in the database as it was typed?

I'm saying that based on how a couple of searches looked to me that the above is true.

kkrell wrote:
Thus, other functions that directly access the database of messages will still find the bad word (which might include search functions, 'bots, etc.)?

I do not believe that bots directly access the database. I think that they read the returned HTML, just like the browser.


Top
 Profile  
Reply with quote  
 
 Post subject:
PostPosted: Sun May 27, 2007 5:51 pm 
Offline
User avatar

Joined: Mon Nov 17, 2003 11:29 am
Posts: 24005
Location: N of Seattle
lichen

quote this post...


Top
 Profile  
Reply with quote  
 
 Post subject:
PostPosted: Sun May 27, 2007 6:55 pm 
Offline
i see what you did there
User avatar

Joined: Mon May 14, 2001 6:00 pm
Posts: 609
Location: Toronto, Ontario
Denny wrote:
lichen

quote this post...


(The quote talked about the moss.)

It's stored in the database as the unfiltered word because otherwise there'd be no way to make a word-filter affect previous posts without going through and changing them all when the filter is turned on, and there'd be no way to ever turn a word-filter off.

Bots and search engines don't see the database, they see the same thing you see. The only place I can think of where the unfiltered words appear is in the RSS feed, and that's a "bug" in that I think the author of the RSS feed plugin forgot to apply filtering.


Top
 Profile  
Reply with quote  
 
 Post subject:
PostPosted: Sun May 27, 2007 7:07 pm 
Offline
User avatar

Joined: Mon Nov 17, 2003 11:29 am
Posts: 24005
Location: N of Seattle
:D so quote does :D
edit is not filtered

Ew, I forgot about the RSS feed... that's a bit tacky!


Top
 Profile  
Reply with quote  
 
 Post subject:
PostPosted: Mon May 28, 2007 6:09 am 
Offline
User avatar

Joined: Wed Dec 17, 2003 9:57 am
Posts: 7645
Location: Minneapolis, MN, USA, Earth
Try doing a Google search on my exact words quoted by Nanohedron.

Google recognizes it, does their search engine work all that differently than a filter? I think that Nanohedron's initial concerns are valid. (Okay, rich answered the question with more authority, but I am a proof of concept kind of person)

Just to clarify, I believe that the automated substitution was put into place not to reduce spam bots, but because filtering software blocks sites that have bad words. As I understand it, my stunt would not allow some people to read page 2.


~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
As a side note I had found the feature when trying to figure out how to explain how to post images (see below). It works to break up a complex post where normal options like "Disable BBCode in this post" just won’t work.

[im[b][/b]g]http://chiffboard.mati.ca/images/smiles/icon_smile_144.gif[/img]

_________________
"Be not deceived by the sweet words of proverbial philosophy. Sugar of lead is a poison."


Top
 Profile  
Reply with quote  
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 43 posts ]  Go to page Previous  1, 2, 3

All times are UTC - 6 hours


Who is online

Users browsing this forum: No registered users and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum

Search for:
Jump to:  
Powered by phpBB® Forum Software © phpBB Group
[ Time : 0.128s | 11 Queries | GZIP : On ]
(dh)