This page will be automatically, frequently (daily) updated with external text from {{{url}}}. This text will be inserted max. infinite times between the <!--SUBSTER-abc-->...<!--SUBSTER-abc--> tags. (This is done by DrTrigonBot) |
If this template is included on a page, DrTrigonBot modifies the page content according to given criteria and inserts text from aribrary sources (if they have changed). In case of problems, please leave a hint on this page (in English or German please). This template was derived from Template:Auto archiving notice an simplified.
Supported are all (plain) text sources, HTML sources by using regex and BeautifulSoup (thus partial XML support also) as well as RSS sources (through online RSS2HTML conversion, alternative with feed2html and Universal Feed Parser on the TS). Those sources may be ZIP compressed too. Last it is possible to use pure Wikipedia sources also.
One thing to be aware of and keep in mind upon using external sources is the copyright - the external page used as source, has to be either available under a free license - like it is the case for Identi.ca and blog.wikimedia.de - or the user has to show probable cause to be the author of the contents used from there. The choice has to be done carefully (e.g. avoid pages containing lots of advertising).
{{User:DrTrigonBot/Subster |url=... |regex=... |value=abc }} ... <!--SUBSTER-abc--><!--SUBSTER-abc-->
{{User:DrTrigonBot/Subster |url=... |beautifulsoup=True }} ... <!--SUBSTER-BS:body--><!--SUBSTER-BS:/-->
{{User:DrTrigonBot/Subster |simple={{xyz|...}} |value=abc }} ... <!--SUBSTER-abc--><!--SUBSTER-abc-->
where the template (eg. xyz) has following format
{{((}}xyz |url=... |regex=... {{))}}
here arbitrary variables eg. (everything known from Help:Magic words#Parser functions) can be used.
optional:
False
) or internal text of a Wikipedia page (True
) as source (default: False
).
subst:
). Can only be used in combination with wiki=True
and has no effect else (default: False
).True
or any number bigger than 0) the first (or given by number) file from the archive gets decompressed and will be used (default: False
).False
).[day of month] [month] [day of week]
' (default: * * *
).(.*)
' or '(.*?)
' for the part of the text to extract (testing and confirmation of the regex is possible by using Python Regex Tool).optional:
('formatedlist', regex, '* [[%s]]')
: The extracted text will be edited again by help of a regular expression and a list of all resulting matches formated as Wikipedia links (with [[...]]) will be returned (in wiki format).('formatedlist_frommatrix', regex, format, cols, head, check)
: Especially for big tables (like csv) with the option to filter entries according to certain criteria (check).('replacetext', '<.*?>', 'abc')
: The extracted text will be filtered again by help of a regular expression, the expression gets replaced, e.g. here all HTML tags contained are replaced by 'abc'.('chain', postprocs)
: Use multiple postproc functions in sequence.False
).False
).Because the bot is not running continously yet (just daily), here the most simple method to test and check the settings and parameter of the template and chose the properly. The tool calls in fact directly the bot code, thus it is a simulation using the real bot (the productive environment).
DrTrigonBot subster simulation panel
The bot is able to recive mails also. Those are stored and used as data source. The mails can also be viewed in order to be able to see what information can be extracted and how.
To access recived mails in the parameter url the following syntax has to be used:
mail://[email protected]/all
for the whole mail text (body) or '/attachment' for attachments.
DrTrigonBot subster mail queue: [email protected]
My appologies for not translating this yet, but you can have a look at w:de:Benutzer:DrTrigonBot/Subster/Doku#Beispiele.
A part of the bot runs permanent or continously (as Daemon) and reacts on some specific edits by users.