Date : Thu, 22 Sep 2011 08:04:21 +1000
From : devries.bob@... (Bob Devries)
Subject: Text conversion?
Darren,
I wonder if the reason for the leading space is that somewhere in its history
the file was converted from CRLF to CR, and instead of removing the LF, it
was replaced with a space?
I've come across this problem in other 8-bit computers I'm involved with.
Regards, Bob Devries
Dalby, QLD, Australia
----- Original Message -----
From: Darren Grant
To: BBC MailList
Sent: Thursday, September 22, 2011 7:35 AM
Subject: Re: [BBC-Micro] Text conversion?
On 21 Sep 2011, at 22:17, Rob wrote:
For the most part, the Beeb /IS/ ASCII ... The main exception being
in MODE 7,
If it helps, this is the code I use as a quick translate of those to
HTML......
$longtext = str_replace(array("#","_","[","]","{","\\","}","~","`"),
array("£","#","«","»","¼","½","¾","÷","-")
, $longtext);
What I am trying to do is convert all of the domesday articles into XML
files, that I can then use lucene to search them.
Seems there are only two differences in modes 0-6 according to the wiki
article http://beebwiki.mdfs.net/ASCII
The other odd thing about these files is every line starts with a space.
Darren
_______________________________________________
bbc-micro mailing list
bbc-micro@...
http://lists.cloud9.co.uk/mailman/listinfo/bbc-micro