How to strip out strange characters when consuming a feed?

KaRa_MaVi

New Member
I am consuming a couple of feeds at the same time and assembling one single feed. When grabbing and 'cleaning up' the description for a particular tag, I find bullet characters, that I cannot for the life of me 'remove' from the output.Doing a simple \[code\]str_replace\[/code\] to find the \[code\]?\[/code\] (just like that, not an \[code\]li\[/code\] or ascii value) character does nothing at all for me. I'm scratching my head and wondering why this is? This does not seem to be an encoding issue, simply a bullet point being sent over in a non ascii safe format.Anyone run into this? A character you couldn't identify or remove?Here is some example text:\[code\]Required Qualifications:?BSME or equivalent four year degree?Minimum four years in blahblah industry experience\[/code\]The above is an example of a description I wish to clean up (would love to replace the bullet with a \[code\]-\[/code\], but would settle for just removing it.Ideas?EDIT -------Based on feedback, here is some additional detail. The character just comes through as is \[code\]?\[/code\]. I doubt it is an encoding issue as this particular location ouputs this data set to either HTML (webpage with the details) or to an XML feed (packaged html tags inside the description field).I consume the multiple xml feeds using \[code\]xml2array\[/code\] (php). I have not had any issues with it before. I am pretty sure it is \[code\]UTF-8\[/code\], just the bullet comes through.To assemble the feeds, I build my own array server side, and once I correlate the proper values from the other feeds, I output the final 'built' xml feed (which I then have an internal app consume).The reason for consuming multiple sources? Gaps in the data that are not available in 1 format.MORE EDITING -------Ok looks like this is an encoding issue, but I still have yet to remove the \[code\]?\[/code\] bullet. I convert it using \[code\]utf8_encode\[/code\] however I get odd symbols that don't copy identically, so I get something like \[code\]a[]
 
Back
Top