Replace A0 ascii character with A20 ascii character

This topic has 13 replies, 7 voices, and was last updated 10 years, 6 months ago by Russ Ross.

Creator

Topic
January 5, 2011 at 8:12 pm #52188
Kirby Knight
Participant
I am using the following proc to try an replace the hex A0 space char with the hex 20 space char.

It seemed to work but The issue is that other characters changed as well. For ex. The
Creator

Topic

Viewing 12 reply threads

Author

Replies
- January 5, 2011 at 9:18 pm #73336
  Jim Kosloskey
  Participant
  Kirby,
  
  Just a guess here but I think you mean this
  
  msgset $mh $newmsg
  
  return “{CONTINUE $mh}” ;# run
  
  Instead of
  
  msgget $mh $newmsg return
  
  “{CONTINUE $mh}” ;# run
  
  email: jim.kosloskey@jim-kosloskey.com 30+ years Cloverleaf, 60 years IT – old fart.
- January 5, 2011 at 10:51 pm #73337
  Charlie Bursell
  Participant
  What Jim says is true. Also the statement:
  
  regsub -all — “xA0” $msgdata “x20” newmsg
  
  Since the fields are ib quotes instead of braces they will be interpreted by Tcl as well as the regsub command and can give weird results. If using the regsub I would do it like:
  
  regsub -all — {xA0} $msgdata “x20” newmsg
  
  However in this case I would not use a regsub at al I would use a string map command
  
  set newmsg [string map
- July 1, 2015 at 5:50 pm #73338
  Jo Ellen Laansma
  Participant
  I have invented my own code, tried supplied code, and cannot get what you are suggesting to work.
  
  In my message, there is an embedded xd7 in OBX-5.
  
  I’ve tried:
- July 1, 2015 at 10:28 pm #73339
  David Barr
  Participant
  ~~Jo Ellen Laansma wrote:~~
  
  x7f to xff to see if there was something I wasn’t seeing.
  
  But that doesn’t come close to covering the range of non-ascii characters. In TCL they go up to at least 0xffff.
  
  Why don’t you try printing out the values of each character in your string? TCL may be doing some kind of strange conversion of the message before it gets to your script.
  
  You could add something like this to your TCL proc.
  
  Code: foreach c [split $msg {}] { scan $c %c dec echo $c $dec }
- July 2, 2015 at 12:33 pm #73340
  Jo Ellen Laansma
  Participant
  I’m a little confused by your statement. The largest value in one signed byte is FF. And a hex editor shows D7.
  
  But I did what you suggested and the mystery character came up with 65533 which is FFFD .
  
  I guess that proves the theory that what it’s reading in is not what I thought and I agree with your statement that the tcl must be somehow interfering with the read. But it still leaves me unable to change the value within the tcl preproc. (I’ve tried xffxfd and xfffd).
  
  I think I’m relegated to capturing the bad messages in a file and use a unix command to replace the original d7 problem (which does work), although cumbersome.
  
  Thank you for your comments. If you figure out (based on the message I uploaded) how to change the character, I would still be very interested in the result.
- July 2, 2015 at 12:38 pm #73341
  Jo Ellen Laansma
  Participant
  By gum, using a Unicode translation fixed it!
  
  I regsub’d ufffd instead of x.
- July 2, 2015 at 1:59 pm #73342
  Jo Ellen Laansma
  Participant
  And then, after successfully testing, I try to implement it and it will not work. Such frustration.
  
  I thought maybe it was the “state” it was in when I resent it, so I made sure the inbound thread received it as it normally would by creating extra threads and sending the message down a pipeline.
  
  Another post said that the tester needed \ or {} to indicate it was a control character and the implemenation did not. I’ve tried both ways and neither work. I gave up passing as a variable, afraid that was impacting the esacape sequences and it still does not work.
- July 2, 2015 at 2:24 pm #73343
  Jo Ellen Laansma
  Participant
  So I thought maybe now that it’s going through the engine, it interprets the characters differently.
  
  …
  
  Yes, it’s back to the original d7.
- July 2, 2015 at 4:07 pm #73344
  David Barr
  Participant
  ~~Jo Ellen Laansma wrote:~~
  
  I’m a little confused by your statement.
- July 2, 2015 at 4:55 pm #73345
  Michael Hertel
  Participant
  If you’re trying to convert only nulls why not use msgget?
  
  set msgdata [msgget -cvtnull x20 $mh]
  
  From help contents:
  
  msgget
  
  Retrieves message data from the region specified by offset and length.
  
  msgget ?-cvtnull char? msgId ?offset? ?length?
  
  Nulls in the retrieved data are converted to char if you use -cvtnull. If the message contains nulls and you do not use the -cvtnull option, you cannot access any message data past the first null in the message from Tcl.
  
  In addition, if you store the string returned in this non-conversion case, only the Tcl-visible string portion is stored.
  
  This command returns the retrieved, converted data
- July 2, 2015 at 7:01 pm #73346
  Jo Ellen Laansma
  Participant
  Lastly… I found that when you “resend” messages it had the unicode character ufffd and when they came from a thread in the traditional way it used xd7.
  
  In case anyone wasnted to know there is a difference between resending and thread throughput.
- July 2, 2015 at 10:05 pm #73347
  David Barr
  Participant
  I tried setting up a tcp-ip inbound thread, a file outbound thread to a /dev/null file and a raw static route between them. I put a proc on the route to print the message contents (and the hex code for non-printable characters). Here’s the code I used:
  
  Code: keylget args MSGID mh set msg [msgget $mh] set msg2 “” foreach ch [split $msg {}] { scan $ch %c dec if { $dec >= 32 && $dec <= 126 } { append msg2 $ch } else { append msg2 [format "” $dec] } } echo msg = $msg2 lappend dispList “CONTINUE $mh”
  
  I sent your message to the file using the TCP tester like this:
  
  Code: hcitcptest -h localhost -p 42772 -t mlp < badlabNL.txt
  
  I used a resend command to send the message like this:
  
  Code: hcicmd -p helloworld -c “win1252_in resend ib_post_tps data 5120 $HCIROOT/test_data/badlabNL.txt nl”
  
  I used the testing tool to run my proc like this:
  
  Code: hcitpstest -r run -x windows-1252 -f nl -c sms_ib_data -e “hcitpstestshowbydisp ” /quovadx/cis5.8/integrator/test_data/badlabNL.txt “tps_win1252”
  
  In all of these cases the multiplication character was in my string as D7.
  
  Code: msg = MSH|^~&|PATHNET|GHS|GEMS|GHS|20150624124246||ORU^R03|Q309464539T306492554|P|2.3PIDPV1ORCOBROBX|||||883056 883134 88302
  
  The only way that I was able to replicate your problem was to use the testing tool and change the encoding to ASCII like this:
  
  Code: hcitpstest -r run -x ASCII -f nl -c sms_ib_data -e “hcitpstestshowbydisp ” /quovadx/cis5.8/integrator/test_data/badlabNL.txt “tps_win1252”
  
  This is the output:
  
  Code: msg = MSH|^~&|PATHNET|GHS|GEMS|GHS|20150624124246||ORU^R03|Q309464539T306492554|P|2.3PIDPV1ORCOBROBX|||||883056 883134 88302
  
  The Unicode FFFD (�) character is actually a special character called the “replacement character”. According to Wikipedia “It is used to indicate problems when a system is not able to render a stream of data to a correct symbol.” It makes sense that I would see this if I picked an encoding that doesn’t match the data.
  
  I was not able to replicate the FFFD problem using the resend command.
- July 3, 2015 at 2:10 pm #73348
  Russ Ross
  Participant
  I too have pulled my hair out when dealing with what I will refer to as funky characters.
  
  I have discovered the tester and a real interface will not produce the same outcome when dealing with funky characters.
  
  In some cases even the real interface is stubborn when dealing with funky characters.
  
  Sometimes this is complicated by advanced encoding schemes and UTF-8 that adds to the struggle, so I don’t envy you.
  
  So what I’ve said so far, sort of summarizes what I see already talked about in this post, so congrats on still clinging to the mountain side.
  
  Keep up the faith because seeing it work in hcitcl tells you it works but just have to get the agrs passed without all the extra unwanted intelligence corrupting it.
  
  Now that you are aware of all these pitfalls but still stuck, I can share one more approach I resorted to in an extreme case to get a hard-coded funky character passed as an argument, when all else failed to work.
  
  I created a cloverleaf lookup table and placed the funky character in the lookup table (not the x0… but the actual character using copy/paste then double check lookup table funky character on back end with hcihd), then accessing the lookup table read the funky character into an argument variable.
  
  I know this is a high level overview of the trick I’m describing but is the best I can do at this time with my work load.
  
  Russ Ross
  RussRoss318@gmail.com
Author

Replies

Viewing 12 reply threads

The forum ‘Tcl Library’ is closed to new topics and replies.