speculum - A simple, straightforward Arch Linux mirror list optimizerGentoo linux updates notifierSimple list...
Traveling through the asteriod belt?
How can animals be objects of ethics without being subjects as well?
How much mayhem could I cause as a sentient fish?
Intern applicant asking for compensation equivalent to that of permanent employee
Use two 8s and two 3s to make the number 24
When can a QA tester start his job?
What does it mean for a caliber to be flat shooting?
What is the difference between rolling more dice versus fewer dice?
Why avoid shared user accounts?
Can you tell from a blurry photo if focus was too close or too far?
SET NOCOUNT Error in handling SQL call after upgrade
Is boss over stepping boundary/micromanaging?
Eww, those bytes are gross
Why do neural networks need so many training examples to perform?
What would the chemical name be for C13H8Cl3NO
Bash Script Function Return True-False
It took me a lot of time to make this, pls like. (YouTube Comments #1)
Cat is tipping over bed-side lamps during the night
What's a good word to describe a public place that looks like it wouldn't be rough?
Why did Democrats in the Senate oppose the Born-Alive Abortion Survivors Protection Act (2019 S.130)?
A title for a history book
How would an AI self awareness kill switch work?
Porting Linux to another platform requirements
Cookies - Should the toggles be on?
speculum - A simple, straightforward Arch Linux mirror list optimizer
Gentoo linux updates notifierSimple list comprehensionEncryption using a mirror fieldReddit mirror encryption challengeSimple Hangman in Pythonlinux rearange field and sort by columnPython 3.6.1 Linux - Blackjack gameSmart Mirror utilising python API'sSimple Bank API scriptFind duplicate files in Linux
$begingroup$
After having had a look at reflector's code base I decided to write a new, more lightweight mirror list optimizer from scratch: speculum.
The script queries the Arch Linux mirror list JSON endpoint and performs filtering, sorting and limiting of mirrors according to the user's input.
Any feedback is welcome.
#! /usr/bin/env python3
#
# speculum - An Arch Linux mirror list updater.
#
# Copyright (C) 2019 Richard Neumann <mail at richard dash neumann period de>
#
# This program is free software: you can redistribute it and/or modify
# it under the terms of the GNU General Public License as published by
# the Free Software Foundation, either version 3 of the License, or
# (at your option) any later version.
#
# This program is distributed in the hope that it will be useful,
# but WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
# GNU General Public License for more details.
#
# You should have received a copy of the GNU General Public License
# along with this program. If not, see <http://www.gnu.org/licenses/>.
#
##############################################################################
"""Yet another Arch Linux mirrorlist optimizer."""
from __future__ import annotations
from argparse import ArgumentParser, Namespace
from datetime import datetime, timedelta
from enum import Enum
from json import load
from logging import INFO, basicConfig, getLogger
from os import linesep
from pathlib import Path
from re import error, compile, Pattern # pylint: disable=W0622
from sys import exit, stderr # pylint: disable=W0622
from typing import Callable, FrozenSet, Generator, Iterable, NamedTuple, Tuple
from urllib.request import urlopen
from urllib.parse import urlparse, ParseResult
MIRRORS_URL = 'https://www.archlinux.org/mirrors/status/json/'
DATE_FORMAT = '%Y-%m-%dT%H:%M:%S%z'
REPO_PATH = '$repo/os/$arch'
LOG_FORMAT = '[%(levelname)s] %(name)s: %(message)s'
LOGGER = getLogger(__file__)
def strings(string: str) -> filter:
"""Splits strings by comma."""
return filter(None, map(lambda s: s.strip().lower(), string.split(',')))
def stringset(string: str) -> FrozenSet[str]:
"""Returns a tuple of strings form a comma separated list."""
return frozenset(strings(string))
def hours(string: str) -> timedelta:
"""Returns a timedelta of the respective
amount of hours from a string.
"""
return timedelta(hours=int(string))
def regex(string: str) -> Pattern:
"""Returns a regular expression."""
try:
return compile(string)
except error:
raise ValueError(str(error))
def sorting(string: str) -> Tuple[Sorting]:
"""Returns a tuple of sorting options
from comma-separated string values.
"""
return tuple(Sorting.from_string(string))
def posint(string: str) -> int:
"""Returns a positive integer."""
integer = int(string)
if integer > 0:
return integer
raise ValueError('Integer must be greater than zero.')
def get_json() -> dict:
"""Returns the mirrors from the respective URL."""
with urlopen(MIRRORS_URL) as response:
return load(response)
def get_mirrors() -> Generator[Mirror]:
"""Yields the respective mirrors."""
for json in get_json()['urls']:
yield Mirror.from_json(json)
def get_sorting_key(order: Tuple[Sorting]) -> Callable:
"""Returns a key function to sort mirrors."""
now = datetime.now()
def key(mirror):
return mirror.get_sorting_key(order, now)
return key
def limit(mirrors: Iterable[Mirror], maximum: int) -> Generator[Mirror]:
"""Limit the amount of mirrors."""
for count, mirror in enumerate(mirrors, start=1):
if maximum is not None and count > maximum:
break
yield mirror
def get_args() -> Namespace:
"""Returns the parsed arguments."""
parser = ArgumentParser(description=__doc__)
parser.add_argument(
'--sort', '-s', type=sorting, default=None, metavar='sorting',
help='sort by the respective properties')
parser.add_argument(
'--reverse', '-r', action='store_true', help='sort in reversed order')
parser.add_argument(
'--countries', '-c', type=stringset, default=None, metavar='countries',
help='match mirrors of these countries')
parser.add_argument(
'--protocols', '-p', type=stringset, default=None, metavar='protocols',
help='match mirrors that use one of the specified protocols')
parser.add_argument(
'--max-age', '-a', type=hours, default=None, metavar='max_age',
help='match mirrors that use one of the specified protocols')
parser.add_argument(
'--regex-incl', '-i', type=regex, default=None, metavar='regex_incl',
help='match mirrors that match the regular expression')
parser.add_argument(
'--regex-excl', '-x', type=regex, default=None, metavar='regex_excl',
help='exclude mirrors that match the regular expression')
parser.add_argument(
'--limit', '-l', type=posint, default=None, metavar='file',
help='limit output to this amount of results')
parser.add_argument(
'--output', '-o', type=Path, default=None, metavar='file',
help='write the output to the specified file instead of stdout')
return parser.parse_args()
def dump_mirrors(mirrors: Iterable[Mirror], path: Path) -> int:
"""Dumps the mirrors to the given path."""
mirrorlist = linesep.join(mirror.mirrorlist_record for mirror in mirrors)
try:
with path.open('w') as file:
file.write(mirrorlist)
except PermissionError as permission_error:
LOGGER.error(permission_error)
return 1
return 0
def print_mirrors(mirrors: Iterable[Mirror]) -> int:
"""Prints the mirrors to STDOUT."""
for mirror in mirrors:
try:
print(mirror.mirrorlist_record, flush=True)
except BrokenPipeError:
stderr.close()
return 0
return 0
def main() -> int:
"""Filters and sorts the mirrors."""
basicConfig(level=INFO, format=LOG_FORMAT)
args = get_args()
mirrors = get_mirrors()
filters = Filter(
args.countries, args.protocols, args.max_age, args.regex_incl,
args.regex_excl)
mirrors = filter(filters.match, mirrors)
key = get_sorting_key(args.sort)
mirrors = sorted(mirrors, key=key, reverse=args.reverse)
mirrors = limit(mirrors, args.limit)
mirrors = tuple(mirrors)
if not mirrors and args.limit != 0:
LOGGER.error('No mirrors found.')
return 1
if args.limit is not None and len(mirrors) < args.limit:
LOGGER.warning('Filter yielded less mirrors than specified limit.')
if args.output:
return dump_mirrors(mirrors, args.output)
return print_mirrors(mirrors)
class Sorting(Enum):
"""Sorting options."""
AGE = 'age'
RATE = 'rate'
COUNTRY = 'country'
SCORE = 'score'
DELAY = 'delay'
@classmethod
def from_string(cls, string: str) -> Generator[Sorting]:
"""Returns a tuple of sortings from the respective string."""
for option in strings(string):
yield cls(option)
class Duration(NamedTuple):
"""Represents the duration data on a mirror."""
average: float
stddev: float
@property
def sorting_key(self) -> Tuple[float]:
"""Returns a sorting key."""
average = float('inf') if self.average is None else self.average
stddev = float('inf') if self.stddev is None else self.stddev
return (average, stddev)
class Country(NamedTuple):
"""Represents country information."""
name: str
code: str
def match(self, string: str) -> bool:
"""Matches a country description."""
return string.lower() in {self.name.lower(), self.code.lower()}
@property
def sorting_key(self) -> Tuple[str]:
"""Returns a sorting key."""
name = '~' if self.name is None else self.name
code = '~' if self.code is None else self.code
return (name, code)
class Mirror(NamedTuple):
"""Represents information about a mirror."""
url: ParseResult
last_sync: datetime
completion: float
delay: int
duration: Duration
score: float
active: bool
country: Country
isos: bool
ipv4: bool
ipv6: bool
details: ParseResult
@classmethod
def from_json(cls, json: dict) -> Mirror:
"""Returns a new mirror from a JSON-ish dict."""
url = urlparse(json['url'])
last_sync = json['last_sync']
if last_sync is not None:
last_sync = datetime.strptime(last_sync, DATE_FORMAT).replace(
tzinfo=None)
duration_avg = json['duration_avg']
duration_stddev = json['duration_stddev']
duration = Duration(duration_avg, duration_stddev)
country = json['country']
country_code = json['country_code']
country = Country(country, country_code)
details = urlparse(json['details'])
return cls(
url, last_sync, json['completion_pct'], json['delay'], duration,
json['score'], json['active'], country, json['isos'], json['ipv4'],
json['ipv6'], details)
@property
def mirrorlist_url(self) -> ParseResult:
"""Returns a mirror list URL."""
scheme, netloc, path, params, query, fragment = self.url
if not path.endswith('/'):
path += '/'
return ParseResult(
scheme, netloc, path + REPO_PATH, params, query, fragment)
@property
def mirrorlist_record(self) -> str:
"""Returns a mirror list record."""
return f'Server = {self.mirrorlist_url.geturl()}'
def get_sorting_key(self, order: Tuple[Sorting], now: datetime) -> Tuple:
"""Returns a tuple of the soring keys in the desired order."""
if not order:
return ()
key = []
for option in order:
if option == Sorting.AGE:
if self.last_sync is None:
key.append(now - datetime.fromtimestamp(0))
else:
key.append(now - self.last_sync)
elif option == Sorting.RATE:
key.append(self.duration.sorting_key)
elif option == Sorting.COUNTRY:
key.append(self.country.sorting_key)
elif option == Sorting.SCORE:
key.append(float('inf') if self.score is None else self.score)
elif option == Sorting.DELAY:
key.append(float('inf') if self.delay is None else self.delay)
else:
raise ValueError(f'Invalid sorting option: {option}.')
return tuple(key)
class Filter(NamedTuple):
"""Represents a set of mirror filtering options."""
countries: FrozenSet[str]
protocols: FrozenSet[str]
max_age: timedelta
regex_incl: Pattern
regex_excl: Pattern
def match(self, mirror: Mirror) -> bool:
"""Matches the mirror."""
if self.countries is not None:
if not any(mirror.country.match(c) for c in self.countries):
return False
if self.protocols is not None:
if mirror.url.scheme.lower() not in self.protocols:
return False
if self.max_age is not None:
if mirror.last_sync + self.max_age < datetime.now():
return False
if self.regex_incl is not None:
if not self.regex_incl.fullmatch(mirror.url.geturl()):
return False
if self.regex_excl is not None:
if self.regex_excl.fullmatch(mirror.url.geturl()):
return False
return True
if __name__ == '__main__':
try:
exit(main())
except KeyboardInterrupt:
LOGGER.error('Aborted by user.')
exit(1)
Python version: 3.7
python python-3.x
$endgroup$
add a comment |
$begingroup$
After having had a look at reflector's code base I decided to write a new, more lightweight mirror list optimizer from scratch: speculum.
The script queries the Arch Linux mirror list JSON endpoint and performs filtering, sorting and limiting of mirrors according to the user's input.
Any feedback is welcome.
#! /usr/bin/env python3
#
# speculum - An Arch Linux mirror list updater.
#
# Copyright (C) 2019 Richard Neumann <mail at richard dash neumann period de>
#
# This program is free software: you can redistribute it and/or modify
# it under the terms of the GNU General Public License as published by
# the Free Software Foundation, either version 3 of the License, or
# (at your option) any later version.
#
# This program is distributed in the hope that it will be useful,
# but WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
# GNU General Public License for more details.
#
# You should have received a copy of the GNU General Public License
# along with this program. If not, see <http://www.gnu.org/licenses/>.
#
##############################################################################
"""Yet another Arch Linux mirrorlist optimizer."""
from __future__ import annotations
from argparse import ArgumentParser, Namespace
from datetime import datetime, timedelta
from enum import Enum
from json import load
from logging import INFO, basicConfig, getLogger
from os import linesep
from pathlib import Path
from re import error, compile, Pattern # pylint: disable=W0622
from sys import exit, stderr # pylint: disable=W0622
from typing import Callable, FrozenSet, Generator, Iterable, NamedTuple, Tuple
from urllib.request import urlopen
from urllib.parse import urlparse, ParseResult
MIRRORS_URL = 'https://www.archlinux.org/mirrors/status/json/'
DATE_FORMAT = '%Y-%m-%dT%H:%M:%S%z'
REPO_PATH = '$repo/os/$arch'
LOG_FORMAT = '[%(levelname)s] %(name)s: %(message)s'
LOGGER = getLogger(__file__)
def strings(string: str) -> filter:
"""Splits strings by comma."""
return filter(None, map(lambda s: s.strip().lower(), string.split(',')))
def stringset(string: str) -> FrozenSet[str]:
"""Returns a tuple of strings form a comma separated list."""
return frozenset(strings(string))
def hours(string: str) -> timedelta:
"""Returns a timedelta of the respective
amount of hours from a string.
"""
return timedelta(hours=int(string))
def regex(string: str) -> Pattern:
"""Returns a regular expression."""
try:
return compile(string)
except error:
raise ValueError(str(error))
def sorting(string: str) -> Tuple[Sorting]:
"""Returns a tuple of sorting options
from comma-separated string values.
"""
return tuple(Sorting.from_string(string))
def posint(string: str) -> int:
"""Returns a positive integer."""
integer = int(string)
if integer > 0:
return integer
raise ValueError('Integer must be greater than zero.')
def get_json() -> dict:
"""Returns the mirrors from the respective URL."""
with urlopen(MIRRORS_URL) as response:
return load(response)
def get_mirrors() -> Generator[Mirror]:
"""Yields the respective mirrors."""
for json in get_json()['urls']:
yield Mirror.from_json(json)
def get_sorting_key(order: Tuple[Sorting]) -> Callable:
"""Returns a key function to sort mirrors."""
now = datetime.now()
def key(mirror):
return mirror.get_sorting_key(order, now)
return key
def limit(mirrors: Iterable[Mirror], maximum: int) -> Generator[Mirror]:
"""Limit the amount of mirrors."""
for count, mirror in enumerate(mirrors, start=1):
if maximum is not None and count > maximum:
break
yield mirror
def get_args() -> Namespace:
"""Returns the parsed arguments."""
parser = ArgumentParser(description=__doc__)
parser.add_argument(
'--sort', '-s', type=sorting, default=None, metavar='sorting',
help='sort by the respective properties')
parser.add_argument(
'--reverse', '-r', action='store_true', help='sort in reversed order')
parser.add_argument(
'--countries', '-c', type=stringset, default=None, metavar='countries',
help='match mirrors of these countries')
parser.add_argument(
'--protocols', '-p', type=stringset, default=None, metavar='protocols',
help='match mirrors that use one of the specified protocols')
parser.add_argument(
'--max-age', '-a', type=hours, default=None, metavar='max_age',
help='match mirrors that use one of the specified protocols')
parser.add_argument(
'--regex-incl', '-i', type=regex, default=None, metavar='regex_incl',
help='match mirrors that match the regular expression')
parser.add_argument(
'--regex-excl', '-x', type=regex, default=None, metavar='regex_excl',
help='exclude mirrors that match the regular expression')
parser.add_argument(
'--limit', '-l', type=posint, default=None, metavar='file',
help='limit output to this amount of results')
parser.add_argument(
'--output', '-o', type=Path, default=None, metavar='file',
help='write the output to the specified file instead of stdout')
return parser.parse_args()
def dump_mirrors(mirrors: Iterable[Mirror], path: Path) -> int:
"""Dumps the mirrors to the given path."""
mirrorlist = linesep.join(mirror.mirrorlist_record for mirror in mirrors)
try:
with path.open('w') as file:
file.write(mirrorlist)
except PermissionError as permission_error:
LOGGER.error(permission_error)
return 1
return 0
def print_mirrors(mirrors: Iterable[Mirror]) -> int:
"""Prints the mirrors to STDOUT."""
for mirror in mirrors:
try:
print(mirror.mirrorlist_record, flush=True)
except BrokenPipeError:
stderr.close()
return 0
return 0
def main() -> int:
"""Filters and sorts the mirrors."""
basicConfig(level=INFO, format=LOG_FORMAT)
args = get_args()
mirrors = get_mirrors()
filters = Filter(
args.countries, args.protocols, args.max_age, args.regex_incl,
args.regex_excl)
mirrors = filter(filters.match, mirrors)
key = get_sorting_key(args.sort)
mirrors = sorted(mirrors, key=key, reverse=args.reverse)
mirrors = limit(mirrors, args.limit)
mirrors = tuple(mirrors)
if not mirrors and args.limit != 0:
LOGGER.error('No mirrors found.')
return 1
if args.limit is not None and len(mirrors) < args.limit:
LOGGER.warning('Filter yielded less mirrors than specified limit.')
if args.output:
return dump_mirrors(mirrors, args.output)
return print_mirrors(mirrors)
class Sorting(Enum):
"""Sorting options."""
AGE = 'age'
RATE = 'rate'
COUNTRY = 'country'
SCORE = 'score'
DELAY = 'delay'
@classmethod
def from_string(cls, string: str) -> Generator[Sorting]:
"""Returns a tuple of sortings from the respective string."""
for option in strings(string):
yield cls(option)
class Duration(NamedTuple):
"""Represents the duration data on a mirror."""
average: float
stddev: float
@property
def sorting_key(self) -> Tuple[float]:
"""Returns a sorting key."""
average = float('inf') if self.average is None else self.average
stddev = float('inf') if self.stddev is None else self.stddev
return (average, stddev)
class Country(NamedTuple):
"""Represents country information."""
name: str
code: str
def match(self, string: str) -> bool:
"""Matches a country description."""
return string.lower() in {self.name.lower(), self.code.lower()}
@property
def sorting_key(self) -> Tuple[str]:
"""Returns a sorting key."""
name = '~' if self.name is None else self.name
code = '~' if self.code is None else self.code
return (name, code)
class Mirror(NamedTuple):
"""Represents information about a mirror."""
url: ParseResult
last_sync: datetime
completion: float
delay: int
duration: Duration
score: float
active: bool
country: Country
isos: bool
ipv4: bool
ipv6: bool
details: ParseResult
@classmethod
def from_json(cls, json: dict) -> Mirror:
"""Returns a new mirror from a JSON-ish dict."""
url = urlparse(json['url'])
last_sync = json['last_sync']
if last_sync is not None:
last_sync = datetime.strptime(last_sync, DATE_FORMAT).replace(
tzinfo=None)
duration_avg = json['duration_avg']
duration_stddev = json['duration_stddev']
duration = Duration(duration_avg, duration_stddev)
country = json['country']
country_code = json['country_code']
country = Country(country, country_code)
details = urlparse(json['details'])
return cls(
url, last_sync, json['completion_pct'], json['delay'], duration,
json['score'], json['active'], country, json['isos'], json['ipv4'],
json['ipv6'], details)
@property
def mirrorlist_url(self) -> ParseResult:
"""Returns a mirror list URL."""
scheme, netloc, path, params, query, fragment = self.url
if not path.endswith('/'):
path += '/'
return ParseResult(
scheme, netloc, path + REPO_PATH, params, query, fragment)
@property
def mirrorlist_record(self) -> str:
"""Returns a mirror list record."""
return f'Server = {self.mirrorlist_url.geturl()}'
def get_sorting_key(self, order: Tuple[Sorting], now: datetime) -> Tuple:
"""Returns a tuple of the soring keys in the desired order."""
if not order:
return ()
key = []
for option in order:
if option == Sorting.AGE:
if self.last_sync is None:
key.append(now - datetime.fromtimestamp(0))
else:
key.append(now - self.last_sync)
elif option == Sorting.RATE:
key.append(self.duration.sorting_key)
elif option == Sorting.COUNTRY:
key.append(self.country.sorting_key)
elif option == Sorting.SCORE:
key.append(float('inf') if self.score is None else self.score)
elif option == Sorting.DELAY:
key.append(float('inf') if self.delay is None else self.delay)
else:
raise ValueError(f'Invalid sorting option: {option}.')
return tuple(key)
class Filter(NamedTuple):
"""Represents a set of mirror filtering options."""
countries: FrozenSet[str]
protocols: FrozenSet[str]
max_age: timedelta
regex_incl: Pattern
regex_excl: Pattern
def match(self, mirror: Mirror) -> bool:
"""Matches the mirror."""
if self.countries is not None:
if not any(mirror.country.match(c) for c in self.countries):
return False
if self.protocols is not None:
if mirror.url.scheme.lower() not in self.protocols:
return False
if self.max_age is not None:
if mirror.last_sync + self.max_age < datetime.now():
return False
if self.regex_incl is not None:
if not self.regex_incl.fullmatch(mirror.url.geturl()):
return False
if self.regex_excl is not None:
if self.regex_excl.fullmatch(mirror.url.geturl()):
return False
return True
if __name__ == '__main__':
try:
exit(main())
except KeyboardInterrupt:
LOGGER.error('Aborted by user.')
exit(1)
Python version: 3.7
python python-3.x
$endgroup$
1
$begingroup$
Which Python 3 version is this supposed to run on? With my Python 3.6 I get an error when doingfrom __future__ import annotations
andfrom re import Pattern
.
$endgroup$
– Graipher
1 hour ago
add a comment |
$begingroup$
After having had a look at reflector's code base I decided to write a new, more lightweight mirror list optimizer from scratch: speculum.
The script queries the Arch Linux mirror list JSON endpoint and performs filtering, sorting and limiting of mirrors according to the user's input.
Any feedback is welcome.
#! /usr/bin/env python3
#
# speculum - An Arch Linux mirror list updater.
#
# Copyright (C) 2019 Richard Neumann <mail at richard dash neumann period de>
#
# This program is free software: you can redistribute it and/or modify
# it under the terms of the GNU General Public License as published by
# the Free Software Foundation, either version 3 of the License, or
# (at your option) any later version.
#
# This program is distributed in the hope that it will be useful,
# but WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
# GNU General Public License for more details.
#
# You should have received a copy of the GNU General Public License
# along with this program. If not, see <http://www.gnu.org/licenses/>.
#
##############################################################################
"""Yet another Arch Linux mirrorlist optimizer."""
from __future__ import annotations
from argparse import ArgumentParser, Namespace
from datetime import datetime, timedelta
from enum import Enum
from json import load
from logging import INFO, basicConfig, getLogger
from os import linesep
from pathlib import Path
from re import error, compile, Pattern # pylint: disable=W0622
from sys import exit, stderr # pylint: disable=W0622
from typing import Callable, FrozenSet, Generator, Iterable, NamedTuple, Tuple
from urllib.request import urlopen
from urllib.parse import urlparse, ParseResult
MIRRORS_URL = 'https://www.archlinux.org/mirrors/status/json/'
DATE_FORMAT = '%Y-%m-%dT%H:%M:%S%z'
REPO_PATH = '$repo/os/$arch'
LOG_FORMAT = '[%(levelname)s] %(name)s: %(message)s'
LOGGER = getLogger(__file__)
def strings(string: str) -> filter:
"""Splits strings by comma."""
return filter(None, map(lambda s: s.strip().lower(), string.split(',')))
def stringset(string: str) -> FrozenSet[str]:
"""Returns a tuple of strings form a comma separated list."""
return frozenset(strings(string))
def hours(string: str) -> timedelta:
"""Returns a timedelta of the respective
amount of hours from a string.
"""
return timedelta(hours=int(string))
def regex(string: str) -> Pattern:
"""Returns a regular expression."""
try:
return compile(string)
except error:
raise ValueError(str(error))
def sorting(string: str) -> Tuple[Sorting]:
"""Returns a tuple of sorting options
from comma-separated string values.
"""
return tuple(Sorting.from_string(string))
def posint(string: str) -> int:
"""Returns a positive integer."""
integer = int(string)
if integer > 0:
return integer
raise ValueError('Integer must be greater than zero.')
def get_json() -> dict:
"""Returns the mirrors from the respective URL."""
with urlopen(MIRRORS_URL) as response:
return load(response)
def get_mirrors() -> Generator[Mirror]:
"""Yields the respective mirrors."""
for json in get_json()['urls']:
yield Mirror.from_json(json)
def get_sorting_key(order: Tuple[Sorting]) -> Callable:
"""Returns a key function to sort mirrors."""
now = datetime.now()
def key(mirror):
return mirror.get_sorting_key(order, now)
return key
def limit(mirrors: Iterable[Mirror], maximum: int) -> Generator[Mirror]:
"""Limit the amount of mirrors."""
for count, mirror in enumerate(mirrors, start=1):
if maximum is not None and count > maximum:
break
yield mirror
def get_args() -> Namespace:
"""Returns the parsed arguments."""
parser = ArgumentParser(description=__doc__)
parser.add_argument(
'--sort', '-s', type=sorting, default=None, metavar='sorting',
help='sort by the respective properties')
parser.add_argument(
'--reverse', '-r', action='store_true', help='sort in reversed order')
parser.add_argument(
'--countries', '-c', type=stringset, default=None, metavar='countries',
help='match mirrors of these countries')
parser.add_argument(
'--protocols', '-p', type=stringset, default=None, metavar='protocols',
help='match mirrors that use one of the specified protocols')
parser.add_argument(
'--max-age', '-a', type=hours, default=None, metavar='max_age',
help='match mirrors that use one of the specified protocols')
parser.add_argument(
'--regex-incl', '-i', type=regex, default=None, metavar='regex_incl',
help='match mirrors that match the regular expression')
parser.add_argument(
'--regex-excl', '-x', type=regex, default=None, metavar='regex_excl',
help='exclude mirrors that match the regular expression')
parser.add_argument(
'--limit', '-l', type=posint, default=None, metavar='file',
help='limit output to this amount of results')
parser.add_argument(
'--output', '-o', type=Path, default=None, metavar='file',
help='write the output to the specified file instead of stdout')
return parser.parse_args()
def dump_mirrors(mirrors: Iterable[Mirror], path: Path) -> int:
"""Dumps the mirrors to the given path."""
mirrorlist = linesep.join(mirror.mirrorlist_record for mirror in mirrors)
try:
with path.open('w') as file:
file.write(mirrorlist)
except PermissionError as permission_error:
LOGGER.error(permission_error)
return 1
return 0
def print_mirrors(mirrors: Iterable[Mirror]) -> int:
"""Prints the mirrors to STDOUT."""
for mirror in mirrors:
try:
print(mirror.mirrorlist_record, flush=True)
except BrokenPipeError:
stderr.close()
return 0
return 0
def main() -> int:
"""Filters and sorts the mirrors."""
basicConfig(level=INFO, format=LOG_FORMAT)
args = get_args()
mirrors = get_mirrors()
filters = Filter(
args.countries, args.protocols, args.max_age, args.regex_incl,
args.regex_excl)
mirrors = filter(filters.match, mirrors)
key = get_sorting_key(args.sort)
mirrors = sorted(mirrors, key=key, reverse=args.reverse)
mirrors = limit(mirrors, args.limit)
mirrors = tuple(mirrors)
if not mirrors and args.limit != 0:
LOGGER.error('No mirrors found.')
return 1
if args.limit is not None and len(mirrors) < args.limit:
LOGGER.warning('Filter yielded less mirrors than specified limit.')
if args.output:
return dump_mirrors(mirrors, args.output)
return print_mirrors(mirrors)
class Sorting(Enum):
"""Sorting options."""
AGE = 'age'
RATE = 'rate'
COUNTRY = 'country'
SCORE = 'score'
DELAY = 'delay'
@classmethod
def from_string(cls, string: str) -> Generator[Sorting]:
"""Returns a tuple of sortings from the respective string."""
for option in strings(string):
yield cls(option)
class Duration(NamedTuple):
"""Represents the duration data on a mirror."""
average: float
stddev: float
@property
def sorting_key(self) -> Tuple[float]:
"""Returns a sorting key."""
average = float('inf') if self.average is None else self.average
stddev = float('inf') if self.stddev is None else self.stddev
return (average, stddev)
class Country(NamedTuple):
"""Represents country information."""
name: str
code: str
def match(self, string: str) -> bool:
"""Matches a country description."""
return string.lower() in {self.name.lower(), self.code.lower()}
@property
def sorting_key(self) -> Tuple[str]:
"""Returns a sorting key."""
name = '~' if self.name is None else self.name
code = '~' if self.code is None else self.code
return (name, code)
class Mirror(NamedTuple):
"""Represents information about a mirror."""
url: ParseResult
last_sync: datetime
completion: float
delay: int
duration: Duration
score: float
active: bool
country: Country
isos: bool
ipv4: bool
ipv6: bool
details: ParseResult
@classmethod
def from_json(cls, json: dict) -> Mirror:
"""Returns a new mirror from a JSON-ish dict."""
url = urlparse(json['url'])
last_sync = json['last_sync']
if last_sync is not None:
last_sync = datetime.strptime(last_sync, DATE_FORMAT).replace(
tzinfo=None)
duration_avg = json['duration_avg']
duration_stddev = json['duration_stddev']
duration = Duration(duration_avg, duration_stddev)
country = json['country']
country_code = json['country_code']
country = Country(country, country_code)
details = urlparse(json['details'])
return cls(
url, last_sync, json['completion_pct'], json['delay'], duration,
json['score'], json['active'], country, json['isos'], json['ipv4'],
json['ipv6'], details)
@property
def mirrorlist_url(self) -> ParseResult:
"""Returns a mirror list URL."""
scheme, netloc, path, params, query, fragment = self.url
if not path.endswith('/'):
path += '/'
return ParseResult(
scheme, netloc, path + REPO_PATH, params, query, fragment)
@property
def mirrorlist_record(self) -> str:
"""Returns a mirror list record."""
return f'Server = {self.mirrorlist_url.geturl()}'
def get_sorting_key(self, order: Tuple[Sorting], now: datetime) -> Tuple:
"""Returns a tuple of the soring keys in the desired order."""
if not order:
return ()
key = []
for option in order:
if option == Sorting.AGE:
if self.last_sync is None:
key.append(now - datetime.fromtimestamp(0))
else:
key.append(now - self.last_sync)
elif option == Sorting.RATE:
key.append(self.duration.sorting_key)
elif option == Sorting.COUNTRY:
key.append(self.country.sorting_key)
elif option == Sorting.SCORE:
key.append(float('inf') if self.score is None else self.score)
elif option == Sorting.DELAY:
key.append(float('inf') if self.delay is None else self.delay)
else:
raise ValueError(f'Invalid sorting option: {option}.')
return tuple(key)
class Filter(NamedTuple):
"""Represents a set of mirror filtering options."""
countries: FrozenSet[str]
protocols: FrozenSet[str]
max_age: timedelta
regex_incl: Pattern
regex_excl: Pattern
def match(self, mirror: Mirror) -> bool:
"""Matches the mirror."""
if self.countries is not None:
if not any(mirror.country.match(c) for c in self.countries):
return False
if self.protocols is not None:
if mirror.url.scheme.lower() not in self.protocols:
return False
if self.max_age is not None:
if mirror.last_sync + self.max_age < datetime.now():
return False
if self.regex_incl is not None:
if not self.regex_incl.fullmatch(mirror.url.geturl()):
return False
if self.regex_excl is not None:
if self.regex_excl.fullmatch(mirror.url.geturl()):
return False
return True
if __name__ == '__main__':
try:
exit(main())
except KeyboardInterrupt:
LOGGER.error('Aborted by user.')
exit(1)
Python version: 3.7
python python-3.x
$endgroup$
After having had a look at reflector's code base I decided to write a new, more lightweight mirror list optimizer from scratch: speculum.
The script queries the Arch Linux mirror list JSON endpoint and performs filtering, sorting and limiting of mirrors according to the user's input.
Any feedback is welcome.
#! /usr/bin/env python3
#
# speculum - An Arch Linux mirror list updater.
#
# Copyright (C) 2019 Richard Neumann <mail at richard dash neumann period de>
#
# This program is free software: you can redistribute it and/or modify
# it under the terms of the GNU General Public License as published by
# the Free Software Foundation, either version 3 of the License, or
# (at your option) any later version.
#
# This program is distributed in the hope that it will be useful,
# but WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
# GNU General Public License for more details.
#
# You should have received a copy of the GNU General Public License
# along with this program. If not, see <http://www.gnu.org/licenses/>.
#
##############################################################################
"""Yet another Arch Linux mirrorlist optimizer."""
from __future__ import annotations
from argparse import ArgumentParser, Namespace
from datetime import datetime, timedelta
from enum import Enum
from json import load
from logging import INFO, basicConfig, getLogger
from os import linesep
from pathlib import Path
from re import error, compile, Pattern # pylint: disable=W0622
from sys import exit, stderr # pylint: disable=W0622
from typing import Callable, FrozenSet, Generator, Iterable, NamedTuple, Tuple
from urllib.request import urlopen
from urllib.parse import urlparse, ParseResult
MIRRORS_URL = 'https://www.archlinux.org/mirrors/status/json/'
DATE_FORMAT = '%Y-%m-%dT%H:%M:%S%z'
REPO_PATH = '$repo/os/$arch'
LOG_FORMAT = '[%(levelname)s] %(name)s: %(message)s'
LOGGER = getLogger(__file__)
def strings(string: str) -> filter:
"""Splits strings by comma."""
return filter(None, map(lambda s: s.strip().lower(), string.split(',')))
def stringset(string: str) -> FrozenSet[str]:
"""Returns a tuple of strings form a comma separated list."""
return frozenset(strings(string))
def hours(string: str) -> timedelta:
"""Returns a timedelta of the respective
amount of hours from a string.
"""
return timedelta(hours=int(string))
def regex(string: str) -> Pattern:
"""Returns a regular expression."""
try:
return compile(string)
except error:
raise ValueError(str(error))
def sorting(string: str) -> Tuple[Sorting]:
"""Returns a tuple of sorting options
from comma-separated string values.
"""
return tuple(Sorting.from_string(string))
def posint(string: str) -> int:
"""Returns a positive integer."""
integer = int(string)
if integer > 0:
return integer
raise ValueError('Integer must be greater than zero.')
def get_json() -> dict:
"""Returns the mirrors from the respective URL."""
with urlopen(MIRRORS_URL) as response:
return load(response)
def get_mirrors() -> Generator[Mirror]:
"""Yields the respective mirrors."""
for json in get_json()['urls']:
yield Mirror.from_json(json)
def get_sorting_key(order: Tuple[Sorting]) -> Callable:
"""Returns a key function to sort mirrors."""
now = datetime.now()
def key(mirror):
return mirror.get_sorting_key(order, now)
return key
def limit(mirrors: Iterable[Mirror], maximum: int) -> Generator[Mirror]:
"""Limit the amount of mirrors."""
for count, mirror in enumerate(mirrors, start=1):
if maximum is not None and count > maximum:
break
yield mirror
def get_args() -> Namespace:
"""Returns the parsed arguments."""
parser = ArgumentParser(description=__doc__)
parser.add_argument(
'--sort', '-s', type=sorting, default=None, metavar='sorting',
help='sort by the respective properties')
parser.add_argument(
'--reverse', '-r', action='store_true', help='sort in reversed order')
parser.add_argument(
'--countries', '-c', type=stringset, default=None, metavar='countries',
help='match mirrors of these countries')
parser.add_argument(
'--protocols', '-p', type=stringset, default=None, metavar='protocols',
help='match mirrors that use one of the specified protocols')
parser.add_argument(
'--max-age', '-a', type=hours, default=None, metavar='max_age',
help='match mirrors that use one of the specified protocols')
parser.add_argument(
'--regex-incl', '-i', type=regex, default=None, metavar='regex_incl',
help='match mirrors that match the regular expression')
parser.add_argument(
'--regex-excl', '-x', type=regex, default=None, metavar='regex_excl',
help='exclude mirrors that match the regular expression')
parser.add_argument(
'--limit', '-l', type=posint, default=None, metavar='file',
help='limit output to this amount of results')
parser.add_argument(
'--output', '-o', type=Path, default=None, metavar='file',
help='write the output to the specified file instead of stdout')
return parser.parse_args()
def dump_mirrors(mirrors: Iterable[Mirror], path: Path) -> int:
"""Dumps the mirrors to the given path."""
mirrorlist = linesep.join(mirror.mirrorlist_record for mirror in mirrors)
try:
with path.open('w') as file:
file.write(mirrorlist)
except PermissionError as permission_error:
LOGGER.error(permission_error)
return 1
return 0
def print_mirrors(mirrors: Iterable[Mirror]) -> int:
"""Prints the mirrors to STDOUT."""
for mirror in mirrors:
try:
print(mirror.mirrorlist_record, flush=True)
except BrokenPipeError:
stderr.close()
return 0
return 0
def main() -> int:
"""Filters and sorts the mirrors."""
basicConfig(level=INFO, format=LOG_FORMAT)
args = get_args()
mirrors = get_mirrors()
filters = Filter(
args.countries, args.protocols, args.max_age, args.regex_incl,
args.regex_excl)
mirrors = filter(filters.match, mirrors)
key = get_sorting_key(args.sort)
mirrors = sorted(mirrors, key=key, reverse=args.reverse)
mirrors = limit(mirrors, args.limit)
mirrors = tuple(mirrors)
if not mirrors and args.limit != 0:
LOGGER.error('No mirrors found.')
return 1
if args.limit is not None and len(mirrors) < args.limit:
LOGGER.warning('Filter yielded less mirrors than specified limit.')
if args.output:
return dump_mirrors(mirrors, args.output)
return print_mirrors(mirrors)
class Sorting(Enum):
"""Sorting options."""
AGE = 'age'
RATE = 'rate'
COUNTRY = 'country'
SCORE = 'score'
DELAY = 'delay'
@classmethod
def from_string(cls, string: str) -> Generator[Sorting]:
"""Returns a tuple of sortings from the respective string."""
for option in strings(string):
yield cls(option)
class Duration(NamedTuple):
"""Represents the duration data on a mirror."""
average: float
stddev: float
@property
def sorting_key(self) -> Tuple[float]:
"""Returns a sorting key."""
average = float('inf') if self.average is None else self.average
stddev = float('inf') if self.stddev is None else self.stddev
return (average, stddev)
class Country(NamedTuple):
"""Represents country information."""
name: str
code: str
def match(self, string: str) -> bool:
"""Matches a country description."""
return string.lower() in {self.name.lower(), self.code.lower()}
@property
def sorting_key(self) -> Tuple[str]:
"""Returns a sorting key."""
name = '~' if self.name is None else self.name
code = '~' if self.code is None else self.code
return (name, code)
class Mirror(NamedTuple):
"""Represents information about a mirror."""
url: ParseResult
last_sync: datetime
completion: float
delay: int
duration: Duration
score: float
active: bool
country: Country
isos: bool
ipv4: bool
ipv6: bool
details: ParseResult
@classmethod
def from_json(cls, json: dict) -> Mirror:
"""Returns a new mirror from a JSON-ish dict."""
url = urlparse(json['url'])
last_sync = json['last_sync']
if last_sync is not None:
last_sync = datetime.strptime(last_sync, DATE_FORMAT).replace(
tzinfo=None)
duration_avg = json['duration_avg']
duration_stddev = json['duration_stddev']
duration = Duration(duration_avg, duration_stddev)
country = json['country']
country_code = json['country_code']
country = Country(country, country_code)
details = urlparse(json['details'])
return cls(
url, last_sync, json['completion_pct'], json['delay'], duration,
json['score'], json['active'], country, json['isos'], json['ipv4'],
json['ipv6'], details)
@property
def mirrorlist_url(self) -> ParseResult:
"""Returns a mirror list URL."""
scheme, netloc, path, params, query, fragment = self.url
if not path.endswith('/'):
path += '/'
return ParseResult(
scheme, netloc, path + REPO_PATH, params, query, fragment)
@property
def mirrorlist_record(self) -> str:
"""Returns a mirror list record."""
return f'Server = {self.mirrorlist_url.geturl()}'
def get_sorting_key(self, order: Tuple[Sorting], now: datetime) -> Tuple:
"""Returns a tuple of the soring keys in the desired order."""
if not order:
return ()
key = []
for option in order:
if option == Sorting.AGE:
if self.last_sync is None:
key.append(now - datetime.fromtimestamp(0))
else:
key.append(now - self.last_sync)
elif option == Sorting.RATE:
key.append(self.duration.sorting_key)
elif option == Sorting.COUNTRY:
key.append(self.country.sorting_key)
elif option == Sorting.SCORE:
key.append(float('inf') if self.score is None else self.score)
elif option == Sorting.DELAY:
key.append(float('inf') if self.delay is None else self.delay)
else:
raise ValueError(f'Invalid sorting option: {option}.')
return tuple(key)
class Filter(NamedTuple):
"""Represents a set of mirror filtering options."""
countries: FrozenSet[str]
protocols: FrozenSet[str]
max_age: timedelta
regex_incl: Pattern
regex_excl: Pattern
def match(self, mirror: Mirror) -> bool:
"""Matches the mirror."""
if self.countries is not None:
if not any(mirror.country.match(c) for c in self.countries):
return False
if self.protocols is not None:
if mirror.url.scheme.lower() not in self.protocols:
return False
if self.max_age is not None:
if mirror.last_sync + self.max_age < datetime.now():
return False
if self.regex_incl is not None:
if not self.regex_incl.fullmatch(mirror.url.geturl()):
return False
if self.regex_excl is not None:
if self.regex_excl.fullmatch(mirror.url.geturl()):
return False
return True
if __name__ == '__main__':
try:
exit(main())
except KeyboardInterrupt:
LOGGER.error('Aborted by user.')
exit(1)
Python version: 3.7
python python-3.x
python python-3.x
edited 1 hour ago
Richard Neumann
asked 1 hour ago
Richard NeumannRichard Neumann
1,911724
1,911724
1
$begingroup$
Which Python 3 version is this supposed to run on? With my Python 3.6 I get an error when doingfrom __future__ import annotations
andfrom re import Pattern
.
$endgroup$
– Graipher
1 hour ago
add a comment |
1
$begingroup$
Which Python 3 version is this supposed to run on? With my Python 3.6 I get an error when doingfrom __future__ import annotations
andfrom re import Pattern
.
$endgroup$
– Graipher
1 hour ago
1
1
$begingroup$
Which Python 3 version is this supposed to run on? With my Python 3.6 I get an error when doing
from __future__ import annotations
and from re import Pattern
.$endgroup$
– Graipher
1 hour ago
$begingroup$
Which Python 3 version is this supposed to run on? With my Python 3.6 I get an error when doing
from __future__ import annotations
and from re import Pattern
.$endgroup$
– Graipher
1 hour ago
add a comment |
1 Answer
1
active
oldest
votes
$begingroup$
The classic Python file structure is this:
import this
class Foo:
def methods(self):
pass
def function():
pass
def main():
pass
if __name__ == "__main__":
main()
While you do have all of those elements, by putting the classes all the way at the end you had me quite confused.
In your regex
function you are just printing the name of the exception, not the exception text. So you will always just get back ValueError: <class 'sre_constants.error'>
instead of a helpful description. Just use as
:
def regex(string: str) -> Pattern:
"""Returns a regular expression."""
try:
return compile(string)
except error as e:
raise ValueError(str(e))
I think you have slightly over-engineered this. Instead I would use a simple pandas.DataFrame
, which can easily be filtered and sorted. I am going to forgo the command line interface and hardcode the values, you seem to have that part down.
(tbc)
import pandas as pd
from datetime import datetime
sort = "age,country"
countries = "US,Germany"
max_age = 10
mirrors = get_json()
df = pd.DataFrame(mirrors['urls'])
df['age'] = datetime.now() - pd.to_datetime(df.last_sync)
$endgroup$
$begingroup$
Thanks. But regarding the style I thought it was: imports, Exceptions, functions, classes.
$endgroup$
– Richard Neumann
58 mins ago
1
$begingroup$
@RichardNeumann: I'll try to find a reference and finish the alternative solution after lunch...
$endgroup$
– Graipher
56 mins ago
add a comment |
Your Answer
StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["\$", "\$"]]);
});
});
}, "mathjax-editing");
StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "196"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fcodereview.stackexchange.com%2fquestions%2f214459%2fspeculum-a-simple-straightforward-arch-linux-mirror-list-optimizer%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
The classic Python file structure is this:
import this
class Foo:
def methods(self):
pass
def function():
pass
def main():
pass
if __name__ == "__main__":
main()
While you do have all of those elements, by putting the classes all the way at the end you had me quite confused.
In your regex
function you are just printing the name of the exception, not the exception text. So you will always just get back ValueError: <class 'sre_constants.error'>
instead of a helpful description. Just use as
:
def regex(string: str) -> Pattern:
"""Returns a regular expression."""
try:
return compile(string)
except error as e:
raise ValueError(str(e))
I think you have slightly over-engineered this. Instead I would use a simple pandas.DataFrame
, which can easily be filtered and sorted. I am going to forgo the command line interface and hardcode the values, you seem to have that part down.
(tbc)
import pandas as pd
from datetime import datetime
sort = "age,country"
countries = "US,Germany"
max_age = 10
mirrors = get_json()
df = pd.DataFrame(mirrors['urls'])
df['age'] = datetime.now() - pd.to_datetime(df.last_sync)
$endgroup$
$begingroup$
Thanks. But regarding the style I thought it was: imports, Exceptions, functions, classes.
$endgroup$
– Richard Neumann
58 mins ago
1
$begingroup$
@RichardNeumann: I'll try to find a reference and finish the alternative solution after lunch...
$endgroup$
– Graipher
56 mins ago
add a comment |
$begingroup$
The classic Python file structure is this:
import this
class Foo:
def methods(self):
pass
def function():
pass
def main():
pass
if __name__ == "__main__":
main()
While you do have all of those elements, by putting the classes all the way at the end you had me quite confused.
In your regex
function you are just printing the name of the exception, not the exception text. So you will always just get back ValueError: <class 'sre_constants.error'>
instead of a helpful description. Just use as
:
def regex(string: str) -> Pattern:
"""Returns a regular expression."""
try:
return compile(string)
except error as e:
raise ValueError(str(e))
I think you have slightly over-engineered this. Instead I would use a simple pandas.DataFrame
, which can easily be filtered and sorted. I am going to forgo the command line interface and hardcode the values, you seem to have that part down.
(tbc)
import pandas as pd
from datetime import datetime
sort = "age,country"
countries = "US,Germany"
max_age = 10
mirrors = get_json()
df = pd.DataFrame(mirrors['urls'])
df['age'] = datetime.now() - pd.to_datetime(df.last_sync)
$endgroup$
$begingroup$
Thanks. But regarding the style I thought it was: imports, Exceptions, functions, classes.
$endgroup$
– Richard Neumann
58 mins ago
1
$begingroup$
@RichardNeumann: I'll try to find a reference and finish the alternative solution after lunch...
$endgroup$
– Graipher
56 mins ago
add a comment |
$begingroup$
The classic Python file structure is this:
import this
class Foo:
def methods(self):
pass
def function():
pass
def main():
pass
if __name__ == "__main__":
main()
While you do have all of those elements, by putting the classes all the way at the end you had me quite confused.
In your regex
function you are just printing the name of the exception, not the exception text. So you will always just get back ValueError: <class 'sre_constants.error'>
instead of a helpful description. Just use as
:
def regex(string: str) -> Pattern:
"""Returns a regular expression."""
try:
return compile(string)
except error as e:
raise ValueError(str(e))
I think you have slightly over-engineered this. Instead I would use a simple pandas.DataFrame
, which can easily be filtered and sorted. I am going to forgo the command line interface and hardcode the values, you seem to have that part down.
(tbc)
import pandas as pd
from datetime import datetime
sort = "age,country"
countries = "US,Germany"
max_age = 10
mirrors = get_json()
df = pd.DataFrame(mirrors['urls'])
df['age'] = datetime.now() - pd.to_datetime(df.last_sync)
$endgroup$
The classic Python file structure is this:
import this
class Foo:
def methods(self):
pass
def function():
pass
def main():
pass
if __name__ == "__main__":
main()
While you do have all of those elements, by putting the classes all the way at the end you had me quite confused.
In your regex
function you are just printing the name of the exception, not the exception text. So you will always just get back ValueError: <class 'sre_constants.error'>
instead of a helpful description. Just use as
:
def regex(string: str) -> Pattern:
"""Returns a regular expression."""
try:
return compile(string)
except error as e:
raise ValueError(str(e))
I think you have slightly over-engineered this. Instead I would use a simple pandas.DataFrame
, which can easily be filtered and sorted. I am going to forgo the command line interface and hardcode the values, you seem to have that part down.
(tbc)
import pandas as pd
from datetime import datetime
sort = "age,country"
countries = "US,Germany"
max_age = 10
mirrors = get_json()
df = pd.DataFrame(mirrors['urls'])
df['age'] = datetime.now() - pd.to_datetime(df.last_sync)
edited 56 mins ago
answered 1 hour ago
GraipherGraipher
25.1k53687
25.1k53687
$begingroup$
Thanks. But regarding the style I thought it was: imports, Exceptions, functions, classes.
$endgroup$
– Richard Neumann
58 mins ago
1
$begingroup$
@RichardNeumann: I'll try to find a reference and finish the alternative solution after lunch...
$endgroup$
– Graipher
56 mins ago
add a comment |
$begingroup$
Thanks. But regarding the style I thought it was: imports, Exceptions, functions, classes.
$endgroup$
– Richard Neumann
58 mins ago
1
$begingroup$
@RichardNeumann: I'll try to find a reference and finish the alternative solution after lunch...
$endgroup$
– Graipher
56 mins ago
$begingroup$
Thanks. But regarding the style I thought it was: imports, Exceptions, functions, classes.
$endgroup$
– Richard Neumann
58 mins ago
$begingroup$
Thanks. But regarding the style I thought it was: imports, Exceptions, functions, classes.
$endgroup$
– Richard Neumann
58 mins ago
1
1
$begingroup$
@RichardNeumann: I'll try to find a reference and finish the alternative solution after lunch...
$endgroup$
– Graipher
56 mins ago
$begingroup$
@RichardNeumann: I'll try to find a reference and finish the alternative solution after lunch...
$endgroup$
– Graipher
56 mins ago
add a comment |
Thanks for contributing an answer to Code Review Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fcodereview.stackexchange.com%2fquestions%2f214459%2fspeculum-a-simple-straightforward-arch-linux-mirror-list-optimizer%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
1
$begingroup$
Which Python 3 version is this supposed to run on? With my Python 3.6 I get an error when doing
from __future__ import annotations
andfrom re import Pattern
.$endgroup$
– Graipher
1 hour ago