Jump to content
MFrost

Polish diactric characters in epg

Recommended Posts

MFrost

Sometimes it happens that Polish diacritical characters are not displayed correctly in EPG.

 

e.g ÂSwiat wedřug Kiepskich  

https://ibb.co/1r3THpz

https://ibb.co/68wDpwp

 

should beŚwiat według Kiepskich

 

Please correct these.

Share this post


Link to post
Griga

Polish providers are famous for wrong or missing character set information in the EPG data.

 

We can't do anything about it without a sample that contains such EPG data, recorded with the TransEdit Analyzer after right-clicking the PID list -> Select Main SI PIDs.

 

Share this post


Link to post
MFrost

Could you describe me step by step how to do it?

Share this post


Link to post
Griga
  1. Download TransEdit from the customer area and store TransEdit.exe in the DVBViewer installation folder (where DVBViewer.exe is).
  2. Launch TransEdit.
  3. On the left side of the TransEdit main window select the transponder list that you are using for a channel search in DVBViewer.
  4. On the right side of the main window select the frequency on which the channel in question (whose EPG contains wrong characters) is broadcasted.
  5. Click Analyze. It lets the Analyzer Window appear.
  6. Wait some seconds until all streams (PIDs) have appeared in the PID list on the right side of the Analyzer Window.
  7. Right-click the PID list and click "Select Main SI PIDs" in the context menu.
  8. Click "Start Recording". Wait  about 60 seconds, click "Stop Recording". Main Window -> Settings -> Analyzer -> Output Directory tells you where the recoding (a .ts file) gets stored.
  9. Upload the file to some web space. Even zipped it may be too large for appending it in the forum.
  10. Post a link here and tell me which channel / which programme I have to look for.

You may also want to examine the EPG details yourself by expanding the EIT (Event Information Table) nodes in the tree view on the left side of the Analyzer Window. More about it here.

 

Share this post


Link to post
Griga

Zażółć gęślą jaźń :)

 

The result of the analysis: Your cable provider does it correctly, but DVBViewer suffers from the work-arounds that we had to implement some years ago for Polish characters.

 

Polish providers basically are using two character sets: Either the East-Eurpoean ISO-8859-2 or the general ISO 6937 for countries with many diacritical characters. According to the DVB specifications ISO-8859-2 must be indicated explicitely in the EPG data. ISO 6937 must not be indicated because it is the default character set.

 

However, some years ago there were Polish providers that used ISO-8859-2 without any indication, so it got misinterpreted as ISO 6937. Or even worse, they used ISO 6937, but declared it as Russian/Cyrillic or Turkish character set. We got many complaints about wrong characters from Polish users (I still have all these samples) and somehow tried to handle the chaos by implementing an ISO 6937 character set recognition. It first assumes ISO-8859-2 and then tries to find characteristics of ISO 6937. It's a kind of guessing that can't resolve all ambiguities, so there are always certain cases where it does the wrong thing. E.g in case of "ÂSwiat wedřug" DVBViewer assumed ISO-8859-2, though the UPC cable provider is correctly using ISO 6937 in your sample throughout.

 

Maybe in the meantime Polish providers have learned to better adhere to international standards. I've checked the situation on Hotbird (satellite) and found that nc+ and Cyfrowy Polsat are now indicating ISO-8859-2 for all EPG data. I don't know what other cable providers than UPC are doing or what is happening via DVB-T, but maybe it's time for a strategy change in DVBViewer: First assume ISO 6937 and then try to find a proof for ISO-8859-2 (the other way round). This will display true ISO 6937 always correctly, but may still fail in certain cases where ISO-8859-2 is used with missing indication.

 

The short result of a long story: I will provide a test version for you that contains the change. Please check your PMs...

 

Share this post


Link to post
MFrost

it's ok now :)

Share this post


Link to post
Griga

Can you also provide a Polish DVB-T sample in the same way as for DVB-C? It is quite useful for development to have samples from broadcasts that we can't receive ourselves.

Share this post


Link to post
MFrost

OK, Give me a moment.

Share this post


Link to post
Griga

Thanks a lot! Unlike your DVB-C provider, DVB-T uses correctly indicated ISO-8859-2 (East European character set), which is no problem for DVBViewer. I'm glad that I don't have to look after it. :)

 

Zwischenablage01.png

 

 

Share this post


Link to post
MFrost

When will you make the official version DVBViewer with corrected characters for my dvb-c UPC?

Share this post


Link to post
Griga

I don't know. Maybe next year...

Share this post


Link to post
MFrost

Oh my God, I understand that this is a joke...

Share this post


Link to post
Griga

I can't reproduce it, and I can see no reason for it. There has been no EPG related change between 6.1.5.2 and 6.1.5.3 except the character set adjustment. It's quite unlikely that it causes such an issue.

 

Please describe the problem in more detail. When does it occur? Right after DVBViewer start or later? Are all EPG tabs / all channels (including DVB-T) affected or not? Did you countercheck with 6.1.5.2 if  the issue is really 6.1.5.3-specific?

 

Share this post


Link to post

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.


×
×
  • Create New...