ArchiveOrangemail archive

Language Tag Registry Update working group discussion list


ltru.ietf.org
(List home) (Recent threads) (187 other Internet Engineering Task Force (IETF) lists)

Subscription Options

  • RSS or Atom: Read-only subscription using a browser or aggregator. This is the recommended way if you don't need to send messages to the list. You can learn more about feed syndication and clients here.
  • Conventional: All messages are delivered to your mail address, and you can reply. To subscribe, send an email to the list's subscribe address with "subscribe" in the subject line, or visit the list's homepage here.
  • Low traffic list: less than 3 messages per day
  • This list contains about 191 messages, beginning Jun 2011
  • 0 messages added yesterday
Report the Spam
This button sends a spam report to the moderator. Please use it sparingly. For other removal requests, read this.
Are you sure? yes no

Re: [Ltru] Proposed -t0- subtag

Ad
Doug Ewell 1311526298Sun, 24 Jul 2011 16:51:38 +0000 (UTC)
"Broome, Karen" <Karen dot Broome at am dot sony dot com> wrote:

> Forgive me if I'm only able to skim these e-mails, but I'm wondering 
> if there might be a parallel here with a use case like: English 
> language film gets translated into Spoken Japanese (dubbed) for one 
> market/format and Written Japanese (subtitles/captions/AD) for 
> another.Testing my understanding of the proposed extension, I think both would 
be "ja-t-en".  The second could alternatively be "ja-Jpan-t-en", but 
since 'Jpan' is the Suppress-Script for 'ja', it could normally be 
omitted unless one felt it necessary to call attention to the writing 
system.

Folks who insist that spoken content needs to be tagged as such, not 
inferred from the context, would probably say the first tag should be 
"jp-Zxxx-t-en".

The point is that the same Japanese content is converted along two 
discrete paths; there is no indication that one is converted from the 
other.  The source in each case is English ("-t-en").--
Doug Ewell | Thornton, Colorado, USA | RFC 5645, 4645, UTN #14
www.ewellic.org | www.facebook.com/doug.ewell | @DougEwell ­
Broome, Karen 1311527049Sun, 24 Jul 2011 17:04:09 +0000 (UTC)
Hmm. Need to give this some thought and read the draft a little more closely. 

Long ago, I thought it might be appropriate for SMPTE or a similar body to register a singleton and use it to indicate subtitled/written or dubbed/spoken language, but seeing this addition I'm thinking it through again. I think the usage is likely more general than just the motion picture industry. "Zxxx" (code for unwritten documents) is not something I'll likely recommend any time soon for the motion picture industry. I've always opposed that for use to describe spoken language "tracks" though I know my use cases awhile back may have supported the creation of this tag. In most cases, it is true that the language mode can be inferred, but this is not always the case.

Regards,

Karen Broome

-----Original Message-----
From:  [mailto:] On Behalf Of Doug Ewell
Sent: Sunday, July 24, 2011 9:52 AM
To: 
Subject: Re: [Ltru] Proposed -t0- subtag"Broome, Karen" <Karen dot Broome at am dot sony dot com> wrote:

> Forgive me if I'm only able to skim these e-mails, but I'm wondering 
> if there might be a parallel here with a use case like: English 
> language film gets translated into Spoken Japanese (dubbed) for one 
> market/format and Written Japanese (subtitles/captions/AD) for 
> another.Testing my understanding of the proposed extension, I think both would 
be "ja-t-en".  The second could alternatively be "ja-Jpan-t-en", but 
since 'Jpan' is the Suppress-Script for 'ja', it could normally be 
omitted unless one felt it necessary to call attention to the writing 
system.

Folks who insist that spoken content needs to be tagged as such, not 
inferred from the context, would probably say the first tag should be 
"jp-Zxxx-t-en".

The point is that the same Japanese content is converted along two 
discrete paths; there is no indication that one is converted from the 
other.  The source in each case is English ("-t-en").

--
Doug Ewell | Thornton, Colorado, USA | RFC 5645, 4645, UTN #14
www.ewellic.org | www.facebook.com/doug.ewell | @DougEwell ­ 

_______________________________________________
Ltru mailing list

https://www.ietf.org/mailman/listinfo/ltru
Mark Davis ☕ 1311527948Sun, 24 Jul 2011 17:19:08 +0000 (UTC)
We are using Zxxx for spoken-only content, and haven't had any problems with
that.

Mark
*— Il meglio è l’inimico del bene —*On Sun, Jul 24, 2011 at 10:01, Broome, Karen wrote:

> Hmm. Need to give this some thought and read the draft a little more
> closely.
>
> Long ago, I thought it might be appropriate for SMPTE or a similar body to
> register a singleton and use it to indicate subtitled/written or
> dubbed/spoken language, but seeing this addition I'm thinking it through
> again. I think the usage is likely more general than just the motion picture
> industry. "Zxxx" (code for unwritten documents) is not something I'll likely
> recommend any time soon for the motion picture industry. I've always opposed
> that for use to describe spoken language "tracks" though I know my use cases
> awhile back may have supported the creation of this tag. In most cases, it
> is true that the language mode can be inferred, but this is not always the
> case.
>
> Regards,
>
> Karen Broome
>
> -----Original Message-----
> From:  [mailto:] On Behalf Of
> Doug Ewell
> Sent: Sunday, July 24, 2011 9:52 AM
> To: 
> Subject: Re: [Ltru] Proposed -t0- subtag
>
> "Broome, Karen" <Karen dot Broome at am dot sony dot com> wrote:
>
> > Forgive me if I'm only able to skim these e-mails, but I'm wondering
> > if there might be a parallel here with a use case like: English
> > language film gets translated into Spoken Japanese (dubbed) for one
> > market/format and Written Japanese (subtitles/captions/AD) for
> > another.
>
> Testing my understanding of the proposed extension, I think both would
> be "ja-t-en".  The second could alternatively be "ja-Jpan-t-en", but
> since 'Jpan' is the Suppress-Script for 'ja', it could normally be
> omitted unless one felt it necessary to call attention to the writing
> system.
>
> Folks who insist that spoken content needs to be tagged as such, not
> inferred from the context, would probably say the first tag should be
> "jp-Zxxx-t-en".
>
> The point is that the same Japanese content is converted along two
> discrete paths; there is no indication that one is converted from the
> other.  The source in each case is English ("-t-en").
>
> --
> Doug Ewell | Thornton, Colorado, USA | RFC 5645, 4645, UTN #14
> www.ewellic.org | www.facebook.com/doug.ewell | @DougEwell ­
>
> _______________________________________________
> Ltru mailing list
> 
> https://www.ietf.org/mailman/listinfo/ltru
> _______________________________________________
> Ltru mailing list
> 
> https://www.ietf.org/mailman/listinfo/ltru
>
Broome, Karen 1311535460Sun, 24 Jul 2011 19:24:20 +0000 (UTC)
Do you regularly have the “same” content in multiple audio and text versions and need to make that distinction? I guess why I wouldn’t recommend it to the motion picture community is that it’s a very text-centric way of looking at it. It may “work” but I’m guessing if I knew how you used the tag, I’d feel you were misrepresenting the tag’s semantics, which were always fuzzy and have changed over time.  Is your usage based on the tag’s original semantics or its meaning today? Its definition now seems like a bit of an oxymoron to me, but I don’t take issue if others find it useful. I just think the motion picture industry should likely avoid it as I suspect its use in the wild is fairly inconsistent.

Regards,

Karen Broome

From:  [mailto:] On Behalf Of Mark Davis ?
Sent: Sunday, July 24, 2011 10:19 AM
To: Broome, Karen
Cc: Doug Ewell; 
Subject: Re: [Ltru] Proposed -t0- subtag

We are using Zxxx for spoken-only content, and haven't had any problems with that.

Mark
— Il meglio è l’inimico del bene —

On Sun, Jul 24, 2011 at 10:01, Broome, Karen <mailto:> wrote:
Hmm. Need to give this some thought and read the draft a little more closely.

Long ago, I thought it might be appropriate for SMPTE or a similar body to register a singleton and use it to indicate subtitled/written or dubbed/spoken language, but seeing this addition I'm thinking it through again. I think the usage is likely more general than just the motion picture industry. "Zxxx" (code for unwritten documents) is not something I'll likely recommend any time soon for the motion picture industry. I've always opposed that for use to describe spoken language "tracks" though I know my use cases awhile back may have supported the creation of this tag. In most cases, it is true that the language mode can be inferred, but this is not always the case.

Regards,

Karen Broome

-----Original Message-----
From: <mailto: [mailto:<mailto:] On Behalf Of Doug Ewell
Sent: Sunday, July 24, 2011 9:52 AM
To: <mailto:Subject: Re: [Ltru] Proposed -t0- subtag
"Broome, Karen" <Karen dot Broome at am dot sony dot com> wrote:

> Forgive me if I'm only able to skim these e-mails, but I'm wondering
> if there might be a parallel here with a use case like: English
> language film gets translated into Spoken Japanese (dubbed) for one
> market/format and Written Japanese (subtitles/captions/AD) for
> another.Testing my understanding of the proposed extension, I think both would
be "ja-t-en".  The second could alternatively be "ja-Jpan-t-en", but
since 'Jpan' is the Suppress-Script for 'ja', it could normally be
omitted unless one felt it necessary to call attention to the writing
system.

Folks who insist that spoken content needs to be tagged as such, not
inferred from the context, would probably say the first tag should be
"jp-Zxxx-t-en".

The point is that the same Japanese content is converted along two
discrete paths; there is no indication that one is converted from the
other.  The source in each case is English ("-t-en").

--
Doug Ewell | Thornton, Colorado, USA | RFC 5645, 4645, UTN #14
www.ewellic.org<http://www.ewellic.org> | www.facebook.com/doug.ewell<http://www.facebook.com/doug.ewell> | @DougEwell ­

_______________________________________________
Ltru mailing list
<mailto:
https://www.ietf.org/mailman/listinfo/ltru
_______________________________________________
Ltru mailing list
<mailto:
https://www.ietf.org/mailman/listinfo/ltru
Mark Davis ☕ 1311545434Sun, 24 Jul 2011 22:10:34 +0000 (UTC)
From BCP47:

       *  When labeling content that is unwritten (such as a recording
          of human speech), the script subtag should not be used, even
          if the language is customarily written in several scripts.
          Thus, the subtitles to a movie might use the tag "uz-Arab"
          (Uzbek, Arabic script), but the audio track for the same
          language would be tagged simply "uz".  (The tag "uz-Zxxx"
          could also be used where content is not written, as the subtag
          'Zxxx' represents the "Code for unwritten documents".)


Mark
*— Il meglio è l’inimico del bene —*On Sun, Jul 24, 2011 at 12:13, Broome, Karen wrote:

> Do you regularly have the “same” content in multiple audio and text
> versions and need to make that distinction? I guess why I wouldn’t recommend
> it to the motion picture community is that it’s a very text-centric way of
> looking at it. It may “work” but I’m guessing if I knew how you used the
> tag, I’d feel you were misrepresenting the tag’s semantics, which were
> always fuzzy and have changed over time.  Is your usage based on the tag’s
> original semantics or its meaning today? Its definition now seems like a bit
> of an oxymoron to me, but I don’t take issue if others find it useful. I
> just think the motion picture industry should likely avoid it as I suspect
> its use in the wild is fairly inconsistent.****
>
> ** **
>
> Regards,****
>
> ** **
>
> Karen Broome****
>
> ** **
>
> *From:*  [mailto:] *On
> Behalf Of *Mark Davis ?
> *Sent:* Sunday, July 24, 2011 10:19 AM
> *To:* Broome, Karen
> *Cc:* Doug Ewell; 
>
> *Subject:* Re: [Ltru] Proposed -t0- subtag****
>
> ** **
>
> We are using Zxxx for spoken-only content, and haven't had any problems
> with that.****
>
> ** **
>
> Mark****
>
> *— Il meglio è l’inimico del bene —*
>
> ****
>
> On Sun, Jul 24, 2011 at 10:01, Broome, Karen 
> wrote:****
>
> Hmm. Need to give this some thought and read the draft a little more
> closely.
>
> Long ago, I thought it might be appropriate for SMPTE or a similar body to
> register a singleton and use it to indicate subtitled/written or
> dubbed/spoken language, but seeing this addition I'm thinking it through
> again. I think the usage is likely more general than just the motion picture
> industry. "Zxxx" (code for unwritten documents) is not something I'll likely
> recommend any time soon for the motion picture industry. I've always opposed
> that for use to describe spoken language "tracks" though I know my use cases
> awhile back may have supported the creation of this tag. In most cases, it
> is true that the language mode can be inferred, but this is not always the
> case.
>
> Regards,
>
> Karen Broome****
>
>
> -----Original Message-----
> From:  [mailto:] On Behalf Of
> Doug Ewell
> Sent: Sunday, July 24, 2011 9:52 AM
> To: 
> Subject: Re: [Ltru] Proposed -t0- subtag****
>
> "Broome, Karen" <Karen dot Broome at am dot sony dot com> wrote:
>
> > Forgive me if I'm only able to skim these e-mails, but I'm wondering
> > if there might be a parallel here with a use case like: English
> > language film gets translated into Spoken Japanese (dubbed) for one
> > market/format and Written Japanese (subtitles/captions/AD) for
> > another.
>
> Testing my understanding of the proposed extension, I think both would
> be "ja-t-en".  The second could alternatively be "ja-Jpan-t-en", but
> since 'Jpan' is the Suppress-Script for 'ja', it could normally be
> omitted unless one felt it necessary to call attention to the writing
> system.
>
> Folks who insist that spoken content needs to be tagged as such, not
> inferred from the context, would probably say the first tag should be
> "jp-Zxxx-t-en".
>
> The point is that the same Japanese content is converted along two
> discrete paths; there is no indication that one is converted from the
> other.  The source in each case is English ("-t-en").
>
> --
> Doug Ewell | Thornton, Colorado, USA | RFC 5645, 4645, UTN #14
> www.ewellic.org | www.facebook.com/doug.ewell | @DougEwell ­
>
> _______________________________________________
> Ltru mailing list
> 
> https://www.ietf.org/mailman/listinfo/ltru
> _______________________________________________
> Ltru mailing list
> 
> https://www.ietf.org/mailman/listinfo/ltru***...
>
> ** **
>
> _______________________________________________
> Ltru mailing list
> 
> https://www.ietf.org/mailman/listinfo/ltru
>
>
Broome, Karen 1311570753Mon, 25 Jul 2011 05:12:33 +0000 (UTC)
Sorry, I was thinking the original ISO registration had gone through as “No Linguistic content” but now I’m remembering that this was somewhat of a partner tag for “zxx” which does mean that. I wasn’t referencing the RFC text per se, which gives the user an option – and I’m OK with the option. Still, I generally find this muddy. I don’t want to look to a script tag to define the language mode because there are commonly used language modes that are not written forms. The disconnect with the “zxx” tag this tags mirrors may be another reason to avoid use to indicate the language mode.

I am conscious of avoiding the interspecies use cases once raised on this list, but the motion picture industry’s use cases might indicate a need for something slightly more specific in the area of language mode. Not sure if it fits in with this work, but I thought I’d mention it.

Karen

From:  [mailto:] On Behalf Of Mark Davis ?
Sent: Sunday, July 24, 2011 3:11 PM
To: Broome, Karen
Cc: ; Doug Ewell
Subject: Re: [Ltru] Proposed -t0- subtag

From BCP47:

       *  When labeling content that is unwritten (such as a recording

          of human speech), the script subtag should not be used, even

          if the language is customarily written in several scripts.

          Thus, the subtitles to a movie might use the tag "uz-Arab"

          (Uzbek, Arabic script), but the audio track for the same

          language would be tagged simply "uz".  (The tag "uz-Zxxx"

          could also be used where content is not written, as the subtag

          'Zxxx' represents the "Code for unwritten documents".)

Mark
— Il meglio è l’inimico del bene —

On Sun, Jul 24, 2011 at 12:13, Broome, Karen <mailto:> wrote:
Do you regularly have the “same” content in multiple audio and text versions and need to make that distinction? I guess why I wouldn’t recommend it to the motion picture community is that it’s a very text-centric way of looking at it. It may “work” but I’m guessing if I knew how you used the tag, I’d feel you were misrepresenting the tag’s semantics, which were always fuzzy and have changed over time.  Is your usage based on the tag’s original semantics or its meaning today? Its definition now seems like a bit of an oxymoron to me, but I don’t take issue if others find it useful. I just think the motion picture industry should likely avoid it as I suspect its use in the wild is fairly inconsistent.

Regards,

Karen Broome

From: <mailto: [mailto:<mailto:] On Behalf Of Mark Davis ?
Sent: Sunday, July 24, 2011 10:19 AM
To: Broome, Karen
Cc: Doug Ewell; <mailto:

Subject: Re: [Ltru] Proposed -t0- subtag

We are using Zxxx for spoken-only content, and haven't had any problems with that.

Mark
— Il meglio è l’inimico del bene —
On Sun, Jul 24, 2011 at 10:01, Broome, Karen <mailto:> wrote:
Hmm. Need to give this some thought and read the draft a little more closely.

Long ago, I thought it might be appropriate for SMPTE or a similar body to register a singleton and use it to indicate subtitled/written or dubbed/spoken language, but seeing this addition I'm thinking it through again. I think the usage is likely more general than just the motion picture industry. "Zxxx" (code for unwritten documents) is not something I'll likely recommend any time soon for the motion picture industry. I've always opposed that for use to describe spoken language "tracks" though I know my use cases awhile back may have supported the creation of this tag. In most cases, it is true that the language mode can be inferred, but this is not always the case.

Regards,

Karen Broome

-----Original Message-----
From: <mailto: [mailto:<mailto:] On Behalf Of Doug Ewell
Sent: Sunday, July 24, 2011 9:52 AM
To: <mailto:Subject: Re: [Ltru] Proposed -t0- subtag
"Broome, Karen" <Karen dot Broome at am dot sony dot com> wrote:

> Forgive me if I'm only able to skim these e-mails, but I'm wondering
> if there might be a parallel here with a use case like: English
> language film gets translated into Spoken Japanese (dubbed) for one
> market/format and Written Japanese (subtitles/captions/AD) for
> another.Testing my understanding of the proposed extension, I think both would
be "ja-t-en".  The second could alternatively be "ja-Jpan-t-en", but
since 'Jpan' is the Suppress-Script for 'ja', it could normally be
omitted unless one felt it necessary to call attention to the writing
system.

Folks who insist that spoken content needs to be tagged as such, not
inferred from the context, would probably say the first tag should be
"jp-Zxxx-t-en".

The point is that the same Japanese content is converted along two
discrete paths; there is no indication that one is converted from the
other.  The source in each case is English ("-t-en").

--
Doug Ewell | Thornton, Colorado, USA | RFC 5645, 4645, UTN #14
www.ewellic.org<http://www.ewellic.org> | www.facebook.com/doug.ewell<http://www.facebook.com/doug.ewell> | @DougEwell ­

_______________________________________________
Ltru mailing list
<mailto:
https://www.ietf.org/mailman/listinfo/ltru
_______________________________________________
Ltru mailing list
<mailto:
https://www.ietf.org/mailman/listinfo/ltru


_______________________________________________
Ltru mailing list
<mailto:
https://www.ietf.org/mailman/listinfo/ltru
Home | About | Privacy