stem

NASA全社会招聘宇航员:来看看入职需要什么条件

﹥>﹥吖頭↗ 提交于 2020-02-26 07:02:46
  近日, 美国宇航局 NASA 面向所有美国全社会发布了全新的宇航员招聘启事,让我们一起来看看求职条件吧。   首先应聘者要求获得美国公民身份和 STEM 领域的硕士学位,该领域包括工程学、生物学、物理学、计算机科学或者数学领域的学位,此外硕士求职者还需要满足以下条件:   1、为一个科学、技术、工程或数学领域的博士项目工作过 2 年;   2、拥有一个完整的医学或者骨科学位;   3、完成一项国家认可的试点学校课程(或者在 2021 年 6 月前入学)。   其次求职者还必须拥有至少 2 年航空领域的专业经验,或者至少拥有 1000 小时的喷气式飞机飞行时间。   最后求职者还需要通过 NASA 的长期飞行测试。   总的来说,绝大多数人是没有资格求职宇航员的,只有精英中的精英才能填表申请,而 NASA 也只会招揽其中最优秀的人才。   NASA 将于 3 月 2 日开启宇航员的求职流程,感兴趣的读者可以 前往围观 。 来源: oschina 链接: https://my.oschina.net/u/4436414/blog/3167254

【精通特征工程】学习笔记(二)

那年仲夏 提交于 2020-02-06 04:31:17
【精通特征工程】学习笔记Day2&2.5&D3章&P33-页 3、文本数据:扁平化、过滤和分块 3.1 元素袋:将自然文本转换为扁平向量 3.1.1 词袋 一个特征就是一个单词,一个特征向量由这个单词在每篇文档中出现的次数组成 3.1.2 n 元词袋 n-gram(n 元词)是由 n 个标记 (token)组成的序列。 1-gram 就是一个单词(word),又称为一元词(unigram)。 n越大,能表示的信息越丰富,相应的成本也会越高。 eg:计算 n-gram >>> import pandas >>> import json >>> from sklearn.feature_extraction.text import CountVectorizer # 加载前10 000条点评 >>> f = open('data/yelp/v6/yelp_academic_dataset_review.json') >>> js = [] >>> for i in range(10000): ... js.append(json.loads(f.readline())) >>> f.close() >>> review_df = pd.DataFrame(js) # 创建一元词、二元词和三元词的特征转换器。 # 默认情况下,会忽略单字母词,这非常有实际意义, # 因为会除去无意义的词

NLTK中的Stemmers

邮差的信 提交于 2020-02-02 13:29:17
Stemmers 在英语中,一个单词常常是另一个单词的“变种”,如:happy=>happiness,这里happy叫做happiness的词干(stem)。在信息检索系统中,我们常常做的一件事,就是在Term规范化过程中,提取词干(stemming),即除去英文单词分词变换形式的结尾。 本文主要介绍nltk中Stemmer的用法 Porter Stemmer 应用最为广泛的、中等复杂程度的、基于后缀剥离的词干提取算法是波特词干算法,也叫波特词干器(Porter Stemmer)。 from nltk.stem.porter import * stemmer = PorterStemmer() plurals = ['caresses', 'flies', 'dies', 'mules', 'denied','died', 'agreed', 'owned', 'humbled', 'sized','meeting', 'stating', 'siezing', 'itemization','sensational', 'traditional', 'reference', 'colonizer','plotted'] singles = [stemmer.stem(plural) for plural in plurals] print(' '.join(singles)) ''

Matplotlib _ 05 细节设置

走远了吗. 提交于 2020-01-31 04:39:36
import pandas as pd import matplotlib.pyplot as plt women_degrees = pd.read_csv("C:/Users/Amber/Documents/唐宇迪-机器学习课程资料/Python库代码(4个)/3-可视化库matpltlib/percent-bachelors-degrees-women-usa.csv") plt.plot(women_degrees['Year'],women_degrees['Biology']) plt.show() import pandas as pd import matplotlib.pyplot as plt women_degrees = pd.read_csv("C:/Users/Amber/Documents/唐宇迪-机器学习课程资料/Python库代码(4个)/3-可视化库matpltlib/percent-bachelors-degrees-women-usa.csv") plt.plot(women_degrees['Year'],women_degrees['Biology'],c='blue',label='Women') plt.plot(women_degrees['Year'], 100-women_degrees['Biology'], label=

Getting Tor ControlPort to work

我们两清 提交于 2019-12-30 07:41:08
问题 I have installed Tor to run as a service on my windows machine and I am trying to make requests in python through the Stem package. In my torrc file I have specified ControlPort as 9051 and set a HashedControlPassword. When I run netstat, I see that Tor is running on localhost:9050 but there is nothing listening to port 9051. As a result, when I try to connect to the ControlPort in python: Controller.from_port(port=9051) results in a [Errno 10061] No connection could be made because the

Getting Tor ControlPort to work

蹲街弑〆低调 提交于 2019-12-30 07:41:05
问题 I have installed Tor to run as a service on my windows machine and I am trying to make requests in python through the Stem package. In my torrc file I have specified ControlPort as 9051 and set a HashedControlPassword. When I run netstat, I see that Tor is running on localhost:9050 but there is nothing listening to port 9051. As a result, when I try to connect to the ControlPort in python: Controller.from_port(port=9051) results in a [Errno 10061] No connection could be made because the

Unable to use Stem and Tor in Python to change my IP address?

有些话、适合烂在心里 提交于 2019-12-24 08:22:45
问题 I am currently trying to follow the a script I found online here: Periodic Tor IP Rotation The code I am trying to use is the following: import requests from stem import Signal from stem.control import Controller with Controller.from_port(port = 9051) as controller: controller.authenticate() controller.signal(Signal.NEWNYM) proxies = { "http": "http://127.0.0.1:8118" } headers = { 'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_9_1) AppleWebKit/537.73.11 (KHTML, like Gecko) Version/7

Authenticating a Controller with a Tor subprocess using Stem

生来就可爱ヽ(ⅴ<●) 提交于 2019-12-22 11:30:59
问题 I am trying to launch a new tor process (no tor processes currently running on the system) using a 'custom' config by using stems launch_tor_with_config . I wrote a function that will successfully generate and capture a new hashed password. I then use that new password in the config, launch tor and try to authenticate using the same exact passhash and it fails. Here is the code: from stem.process import launch_tor_with_config from stem.control import Controller from subprocess import Popen,

Running stem with tor gives “Process terminated: Timed out”

让人想犯罪 __ 提交于 2019-12-13 18:50:45
问题 While trying to run Stem's To Russia With Love example, I am getting the following error: ~$ python practice.py Starting Tor: Traceback (most recent call last): File "practice.py", line 49, in <module> init_msg_handler = print_bootstrap_lines, File "/usr/local/lib/python2.7/dist-packages/stem/process.py", line 266, in launch_tor_with_config return launch_tor(tor_cmd, args, torrc_path, completion_percent, init_msg_handler, timeout, take_ownership) File "/usr/local/lib/python2.7/dist-packages

Accessing youtube via stem (tor) gives unable to reach url error

孤人 提交于 2019-12-12 03:06:08
问题 I have combined this example in stem with pytube to measure the time it takes for me to download a youtube video via Tor. Here's the code: import io import pycurl import stem.process from stem.util import term import pickle import socks # SocksiPy module import socket import urllib from pytube import YouTube import pdb import time import string import random SOCKS_PORT = 9050 # Set socks proxy and wrap the urllib module socks.setdefaultproxy(socks.PROXY_TYPE_SOCKS5, '127.0.0.1', SOCKS_PORT)