Wikipedia talk:Overcategorization/Archive 9#Non-defining vs. trivial clarification proposed

{{talkarchive}}

BLP, ethnicity, gender

{{main|WT:BLP#Include "ethnicity, gender," to match all other guidelines}}

{{details|WT:BLP#Relevancy for ethnicity, gender, religion and sexuality (WP:EGRS)}}

Resolve arguments about differences between guidelines. Add "ethnicity, gender," to BLP, matching all other guidelines.

To avoid repeating myself ad infinitum:

  • All categorization is required to be both notable and relevant.
  • Certain quibblers have noted that ethnicity and gender are not specifically listed in WP:BLP.
  • WP:BLP is a "policy", while Wikipedia:Categorization, Wikipedia:Categorization of people (WP:COP), Wikipedia:Category names, WP:EGRS, and Wikipedia:Overcategorization (especially WP:OC#EGRS) are "guidelines".
  • Certain quibblers argue that policy trumps guidelines for these special cases.
  • Thus, (non-notable or irrelevant) ethnicity and gender might be allowed for living people, but removed for the dead, undead, or incorporeal.
  • This is difficult to enforce or implement (and was certainly never the intent of the policy).

{{quotation|

Category names do not carry disclaimers or modifiers, so the case for each category must be made clear by the article text and its reliable sources. Categories regarding {{hilite|ethnicity, gender,}} religious beliefs, and sexual orientation should not be used unless the subject has publicly self-identified with the {{hilite|information}} in question; and {{hilite|this information is}} relevant to their notable activities or public life, according to reliable published sources.


...

These principles apply equally to lists, navigation templates, and/or {{tl|infobox}} statements that are based on {{hilite|ethnicity, gender,}} religious beliefs, sexual orientation, or suggest that persons have a poor reputation.

}}

Please visit the Talk section above to certify the slight wording change.
--William Allen Simpson (talk) 15:16, 7 March 2011 (UTC)

Contradiction between categories and WP:OC#EGRS

I wanted to know if there is a reason we have :Category:Sportspeople by ethnicity?

It seems to be a clear contradiction between WP:OC#EGRS and the existence of this category, surely all that it achieves is encouraging the creation of subcategories without anyone showing that there is a substantial article to create.

When a person appears to be able to have upto 7 (at least) ethnicities under current wikipedia usage it seems difficult to avoid issues of WP:UNDUE.

Tetron76 (talk) 10:18, 5 April 2011 (UTC)

:A person really needs to be prominent in some sort of way within their ethnicity. We once had categories of "people by religion." So some politician who was supposedly religion X would be listed that way even if he was never seen to practice religion X and violated most of its principles. I had thought we had put a stop to this. Maybe Mohammed Ali and Islam and maybe African American, but not most people. More than one seems preposterous and pretentious on the part of the biographer/editor. The ubiquitous "they" have traced Obama's antecedents to include "Irish" but that hardly defines him and would be silly to use IMO. (Good luck! :) Student7 (talk) 15:56, 7 April 2011 (UTC)

  • The listed ethnicities are "Sami", "Romani" and "Basque" and "Catalan". The fact that :Category:Sportspeople by nationality is a subcategory seems to suggest an odd ordering of the tree. Nationality is not ethnicity. There has been a fight to get rid of the "European-American baskeball players" and such categories but we have made little progress on it. In the cases of "Romani People" they are a trans-national ethnicity, and so having this as a sub-set of :Category:Romani people by occupation does make sense.John Pack Lambert (talk) 22:21, 28 April 2011 (UTC)
  • "We once had categories of people by religion" was said above? I thing that this is a misnomer. We still have categories of people by religion, including :Category:American Roman Catholics and hundreds of others. We have dismantled many of the religion+occupation categories that cover non-notable intersects, but it is not clear that all our religion+writers cats are being limited to intersects where the religion effects the writing. J. R. R. Tolkien clearly belongs in :Category:British Roman Catholics because his Catholicism is known and notable and some even see it influencing his works in subtle ways, but I do not think he should go in :Category:Roman Catholic writers.John Pack Lambert (talk) 22:23, 1 February 2013 (UTC)

Category:Indian films by topic

Does overcategorization apply to :Category:Indian films by topic? Please advise what is the rule on this. Wiki-uk (talk) 12:47, 29 April 2011 (UTC)

:Sofar no reactions here. I have added the phrase "Note: please avoid overcategorization and overpopulation." on top of the category. My feeling is that as long as an average maximum of two or three main thema of a movie are added, it should be fine. Please note an extreme example of 200 keywords on the IMDB page for [http://www.imdb.com/title/tt0238936/keywords Devdas (2002)]. Wiki-uk (talk) 04:57, 4 May 2011 (UTC)

Category:Priory of Sion hoax

Here a note about removing lots of folks from :Category:Priory of Sion hoax.

I saw Isaac Newton (1643-1727) linked up to the :Category:Priory of Sion hoax which is dealt with in Priory of Sion (hoax) (c:a 1920-1956). As far as I can conclude, the life span of Isaac Newton preclude him from either partaking or debunking the hoax, although he was a very skillful debunker of mint counterfeits in his time. I think it is wrong to link up real individuals mentioned in a category reflecting a:

  • less-than-notable
  • fiction story
  • because it links from a fictional universe to the real universe.

I removed the vast majority of guys mentioned in the hoax, but who didn't partake in the controversy around it (1953 and forth). But I don't know how this refers to Wikipedia:Overcategorization — I just have a gut feeling that it is an overcategorization de luxe. Rursus dixit. (mbork3!) 10:15, 17 May 2011 (UTC)

:I guess maybe WP:OC#TRIVIA. More? Rursus dixit. (mbork3!) 10:22, 17 May 2011 (UTC)

Non-defining vs. trivial clarification proposed

The first entry in this guideline is titled "non-defining or trivial characteristic". I think this is unnecessarily confusing, because if the history of CFD has come to any broad conclusions (and there are few), it has affirmed that a "defining" characteristic is not the same as saying that the characteristic is "non-trivial". "Defining" means something significantly more central to the thing than simply stating it is "non-trivial", "interesting", or "notable". We run into this confusion quite regularly at CFD, and this page may well be the source of the confusion. We haven't been able to reach a consensus on what "defining" means exactly, and I don't want to attempt yet again to craft a definition, but what I am proposing is to separate this first entry into two separate ones: "Non-defining characteristics" and "trivial characteristics", since they are not at all the same type of overcategorization. Any objections, or other thoughts on this? Good Ol’factory (talk) 23:51, 5 September 2011 (UTC)

:With no comments in favour or against, I'll go forward with this small change. But please still raise the issue here if you wish to discuss it. Good Ol’factory (talk) 01:56, 12 September 2011 (UTC)

:: No problems with the change. Here's a definition of "defining" to get the ball rolling: A characteristic of a subject that is "defining" is one that reliable, secondary sources commonly and consistently define, in prose, the subject as having. For example: "Subject is an adjective noun ..." or "Subject, an adjective noun, ..."; here, subject to being used both commonly and consistently, each of adjective and noun may be deemed "defining". Uniplex (talk) 07:11, 12 September 2011 (UTC)

::: I think that's a pretty good start. Maybe this could be included as an example of what would constitute a defining characteristic. The main problems in the past with definitions is trying to find a definition that is exhaustive, but maybe the key is to define it non-exhaustively using several examples. I think what you have set out is a great example that could be used. I've already added two examples of "rules of thumb"; maybe this should be included as well? Good Ol’factory (talk) 20:40, 27 September 2011 (UTC)

::::Yes, that was my thought too. Perhaps it can slot in as an "including" bullet point before the existing two "excluding" ones? Maybe the final words need a tweak: "may be deemed" -> "are likely to be"? Maybe the grammatical terms would be more accessible if replaced with real examples? Please go ahead making any such changes as you see fit. Uniplex (talk) 05:34, 28 September 2011 (UTC)

:::::I agree that this one should be mentioned before the other two, since this one is positive and the other two are negative or exclusionary. For each bullet point we include, we could also include examples gleaned from CFD consensus on what is vs. what is not defining. I'll give this some more thought and try to incorporate your suggestions. Good Ol’factory (talk) 06:30, 28 September 2011 (UTC)

::::::In fact, I've added some of these words and an example at WP:CAT; feel free to BRD if it's not right. Uniplex (talk) 09:58, 7 October 2011 (UTC)

Eponymous categories

A very recent [http://en.wikipedia.org/wiki/index.php?title=Wikipedia:Overcategorization&diff=454018447&oldid=453230301 change] to section Wikipedia:Overcategorization#Small with no potential for growth added without discussion:{{quotation|Note that this cannot apply to eponymous categories: one should not create entire schemes of categorization which themselves only contain a few articles each. For instance, it would be inappropriate to create eponymous categories for each individual in :Category:Alternative rock musicians each containing only one or two articles themselves and apply this rule about a larger scheme.}}

I do not believe this is correct. Such a change would contribute to massive pollution of CfD, dealing with millions of these existing categories. Bad idea. Bad process.

This is a fairly common and obvious scheme to consolidate categories, and keep things manageable. Once there are 2 or 3 articles on a topic, it's much better to put them into 1 eponymous category, and maintain it as a subcategory of the relevant topics. Moreover, it violates the assumptions of the section, that there be no potential for growth. Current TV series come to mind. One season wonders might have no eponymous category, but once there are 2 seasons (main article and 2 season "list of episodes" articles, there SHOULD be a category.
--William Allen Simpson (talk) 08:51, 8 October 2011 (UTC)

Proposed replacement:

{{quotation|Eponymous categories should not be created with only one article, until after the second (or third) article has also been created. For instance, it would be inappropriate to create eponymous categories for every TV series in :Category:Categories named after American television series, each containing only one article themselves. However, multiple articles are best grouped by subject, with the subject in such group categories maintained in a single place for consistency.}}--William Allen Simpson (talk) 09:03, 8 October 2011 (UTC)

::Two articles? Once there are two articles about a topic, it should be maintained in a category? That's pretty small and would result in a flood of thousands of categories that only contain "[TV SHOW]" and "[LIST OF TV SHOW EPISODES]"--how would that be helpful in navigation? —Justin (koavf)TCM☯ 12:42, 8 October 2011 (UTC)

:Unless you have consensus to remove the bit about being part of a scheme, none of this is going to work. Because people will always come around and say that "Categories named after American television series" is a scheme, and that such epo categories should be allowed. --Kbdank71 19:37, 12 October 2011 (UTC)

  • Support the present version. I have followed cfd closely for some years and the only editor who has confused 'Categories named after' with 'part of an established scheme' is User:William Allen Simpson. There has been for many years a presumption against eponymous categories, and particularly against sprawling eponymous categories with no clear inclusion criteria. As I have observed many times, :Category:Categories named after American television series is a 'category of categories' and as such is quite different from (say) :Category:Albums by artist, an established sub-categorisation scheme. Occuli (talk) 00:04, 25 October 2011 (UTC)
  • Keep as is We do not want eponymous categories. They are useful for topics like Abraham Lincoln, but in general it is best to avoid putting categories in categories with the same names. Pop musicians tend to be in so many categories as it is, having every one in a category with the name of the article is a really bad idea.John Pack Lambert (talk) 22:27, 1 February 2013 (UTC)