Talk:Mix'n'match/Catalogues
Changing
editI am changing the page to use my new {{MnM}} template. This lets us do things like set a status, specify a catalog, and generate pre-filled scraper links. I started with the films section, please chime in! --Magnus Manske (talk) 11:28, 6 November 2019 (UTC)
- Why did you removed AarhusWiki, Ribewiki and KoldingWiki from the list? --Trade (talk) 15:36, 6 November 2019 (UTC)
- It wasn't removed, there was a formatting error and it didn't show, it's corrected now. --Adam Harangozó (talk) 20:08, 28 November 2019 (UTC)
new catalog
editI am trying to add new catalog to the list. Where is this described. I have a CSV file with URLs and IDs. From an official source (CMS in US). — The preceding unsigned comment was added by EncycloABC (talk) 15:40, 4 December 2019 (UTC)
Scraping
edit@Adam Harangozó: Can you scrape Sarvavijnanakosam (Malayalam Encyclopaedia) under Encyclopedias (general)|Encyclopedias (general), it uses Mediawiki. I am not too familiar with the tool. Thanks. Gotitbro (talk) 09:04, 15 January 2020 (UTC)
- @Gotitbro: Unfortunately I don't know how to create scrapers either, but maybe @Magnus Manske: or @Gerwoman: can help. --Adam Harangozó (talk) 12:47, 28 January 2020 (UTC)
- Hi, the scraper didn't seem to respond for this site, perhaps because this url doesn't work http://web-edition.sarvavijnanakosam.gov.in/Special:AllPages but only this one http://web-edition.sarvavijnanakosam.gov.in/index.php?title=Special:AllPages
- Anyway, I tried another wiki (https://simpsonswiki.com/wiki/) and I get warnings from the api :
<br /> <b>Notice</b>: Trying to get property 'query' of non-object in <b>/data/project/mix-n-match/autoscrape.inc</b> on line <b>322</b><br /> <br /> <b>Notice</b>: Trying to get property 'allpages' of non-object in <b>/data/project/mix-n-match/autoscrape.inc</b> on line <b>322</b><br /> <br /> <b>Warning</b>: Invalid argument supplied for foreach() in <b>/data/project/mix-n-match/autoscrape.inc</b> on line <b>322</b><br /> <br /> <b>Notice</b>: Undefined offset: 0 in <b>/data/project/mix-n-match/autoscrape.inc</b> on line <b>284</b><br /> {"status":"OK","data":{"html":"","log":["0 (AutoScrapeLevelMediaWiki): Reset [{\"mode\":\"mediawiki\",\"url\":\"https:\\\/\\\/simpsonswiki.com\\\/wiki\\\/Special:AllPages\",\"pos\":0,\"apfrom\":\"\"}]","0 (AutoScrapeLevelMediaWiki): Reset [{\"mode\":\"mediawiki\",\"url\":\"https:\\\/\\\/simpsonswiki.com\\\/wiki\\\/Special:AllPages\",\"pos\":0,\"apfrom\":\"\"}]","Found 0 entries."],"results":[],"last_url":""}}
- So I think only devs can answer that. Eru (talk) 18:16, 29 January 2020 (UTC)
Connected catalogues
edit@Magnus Manske: When adding new catalogues, should we use a new column for noting if a site refers to another external ID which could be scraped by the auxiliary matcher? For example [1] lists the GND number at the bottom. Would this help? — The preceding unsigned comment was added by Adam Harangozó (talk) 21:41, 30 January 2020 (UTC)
Can't figure out scraping
editHi, I'd like to add ITIS as a database to Mix'n'match, but I think scraping it is a bit beyond my technical know-how. If anyone has the knowledge and time to do so, that would be great. Thanks, Enwebb (talk) 19:12, 13 May 2020 (UTC)
Statues Vanderkrogt from static to auto-scraped catalog?
editHello fellow Mixers & Matchers! Would someone (TM) - perhaps Jean-Fred? - be able to help with the following? Statues Vanderkrogt is an excellent catalog of public art in MnM. I imported it once statically myself; but it's outdated and has grown quite a bit over the years. Scraping is beyond my own know-how too, and I was wondering if someone could either transform the current one to an auto-scraped one, or delete the current one after creating a new (auto-scraped) edition?
This website does have some peculiarities, as it operates over two domain names; but the identifiers from one domain name (vanderkrogt.net) are universally applicable. Spinster (talk) 13:51, 17 June 2023 (UTC)
NNP
editI've tried reading the instructions but there's too much of a learning curve - not just the terminology, but there's too many concepts with which I'm unfamiliar.
Should the Newman Numismatic Portal be incorporated into these lists? Thanks. DS (talk) 20:40, 8 September 2023 (UTC)
ORCID Public data file
edit@Magnus Manske: I'm surprised that the ORCID public data file at https://support.orcid.org/hc/en-us/articles/360006897394-How-do-I-get-the-public-data-file is not already in the list of catalogues. I'm aware that it's not complete (since subjects can set info to private), but it seems worthwhile. Stuartyeates (talk) 22:39, 17 February 2024 (UTC)