摘要
今天在用curl調(diào)用一個solr地址查詢數(shù)據(jù)的時候報錯辽话,發(fā)現(xiàn)是因為URL地址中含有
[ 中括號
導致的入宦,既然shell的curl
命令不行考蕾,想著Python的urllib
包可以解析URL地址掖肋,那就嘗試這個
curl 執(zhí)行報錯
錯誤的情況:
root@pts/3 $ curl "http://192.168.xxx.xxx:8983/solr/core1_shard2_replica1/select?q=*:*&fq=sex:1&fq=activeTime_tdt:[NOW-20MINUTE TO NOW]&wt=json&indent=true"
curl: (3) [globbing] error: bad range specification after pos 96
如果去掉中間包含中括號的查詢條件則用curl沒有問題
root@pts/3 $ curl "http://192.168.xxx.xxx:8983/solr/core1_shard2_replica1/select?q=*:*&fq=sex:1&wt=json&indent=true"
{
"responseHeader":{
"status":0,
"QTime":8,
"params":{
"q":"*:*",
"indent":"true",
"fq":"sex:1",
"wt":"json"}},
...
python urllib
這里給出具體的例子
root@pts/3 $ python
Python 2.7.5 (default, Nov 20 2015, 02:00:19)
[GCC 4.8.5 20150623 (Red Hat 4.8.5-4)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> import urllib
>>> urlstr = "http://192.168.xxx.xxx:8983/solr/core1_shard2_replica1/select?q=*:*&fq=sex:1&fq=activeTime_tdt:[NOW-20MINUTE TO NOW]&wt=json&indent=true"
>>> result = urllib.urlopen(urlstr)
>>> print(result)
<addinfourl at 36354168 whose fp = <socket._fileobject object at 0x7f437fe03c50>>
>>> print(result.read())
{
"responseHeader":{
"status":0,
"QTime":11,
"params":{
"q":"*:*",
"indent":"true",
"fq":["sex:1",
"activeTime_tdt:[NOW-20MINUTE TO NOW]"],
"wt":"json"}},
最后可以結合Python的正則表達式來等來實現(xiàn)自己個性化的需求