现在的位置: 首页 > 程序设计> 正文
Java程序自动发布文章到百度空间
2011年07月02日 程序设计 评论数 2 ⁄ 被围观 5,999+

这几天建立了个博客,想把这个博客的内容同时导入到其他博客中去,同时,当在本博客发表一篇博文的时候,也同步传输到其他博客中去,

这样就方便多了,以后只要维护一个就ok了。

CSDN的博客开放的比较好,符合http://www.xmlrpc.com/metaWeblogApi标准,可以有开放的API来调用,容易解决。

目前遇到的问题就是导入博文到百度空间,貌似百度现在还没有做出什么开放的API来,没办法来直接调用,不过也有方法来解决,

那就是通过程序来模拟浏览器的登录过程和发表文章的过程,通过测试,搞定。。O(∩_∩)O哈哈~

下面介绍一下程序的思路和展示一下源码:

1. 基本思路就是通过程序来模拟浏览器的登录过程,本文的代码基于Java语言实现,主要利用的就是Apache的HttpClient包。

2. 大家首先可以利用浏览器的插件来捕获我们正常到百度登录过程,和发表博文过程的所有Http请求的数据,着重分析一下

表单数据的提交,就可以知道百度空间发文在后台做了那些操作和需要哪些重要的数据来验证用户的正常登录。

3. 用程序来模拟浏览器登录的最重要的处理,就是Cookie的保存和后续数据的传输,因为像百度这类需要用户输入账户

和密码来登录的系统,并且具有密码记忆功能的网站,用户的验证信息都存储在Cookie中。

4. 因为为了让百度服务器识别用户的登录,那就要在程序中记录第一次登录过程的Cookie数据,然后在以后的http请求中

将cookie数据回传给服务器,那样服务器就认为你一直登录了,不然文章是不能正确发布出去的,因为你没有cookie的话,

服务器还认为你没有登录过,会拒绝操作。

5. 用户的身份验证解决了以后,百度发表文章的时候,还要多加一步验证,就是通过访问新建博文的页面,获取一个临时的百度

token,它会与你接下来发表的博文一一对应,我猜测可能是服务器支持单用户多点登录,同时发表多篇博文吧,不过这都没

关系了,反正我们照做就行啦。

6. 文章分类的处理,我发现百度这边做的不够智能,如果你发表博文的时候,指定了一个不存在的类别,直接决绝发布,所以,还是

想个笨点的办法,每次发布博文前,先建立一个类别,如果已经有了这个类型,重新建立一次也没关系,反正也不会覆盖掉,因为百度

会拒绝建立新的类别,错误类型是类别已存在,这个善意的错误忽略就ok。。。哈哈

7. 所有这些前期工作做好了,就能正常发布文章了,下面贴出一下源码文件,大家按照自己的方式去修改吧:

1. Config.java

主要配置一下个人百度账号的用户名和密码

1
2
3
4
5
package com.carey.baidublog;   
 public class Config {    
 public static final String USERNAME = "username";    
 public static final String PASSWORD = "password";
}
package com.carey.baidublog;   
 public class Config {    
 public static final String USERNAME = "username";    
 public static final String PASSWORD = "password";
}

2. PublishBlog.java

测试程序,调用函数发布文章到百度空间

1
2
3
4
5
6
7
8
9
10
11
package com.carey.baidublog;  
public class PublistBlog {    
 public static void main(String[] args) {        
  try {            
   BDHttpClient.publishBlog("Article-Title", "Article-Content""Article-category");            
   System.out.println("成功发布一篇文章到百度空间");        
  } catch (Exception e) {            
      e.printStackTrace();        
  }    
 }
}
package com.carey.baidublog;  
public class PublistBlog {    
 public static void main(String[] args) {        
  try {            
   BDHttpClient.publishBlog("Article-Title", "Article-Content",  "Article-category");            
   System.out.println("成功发布一篇文章到百度空间");        
  } catch (Exception e) {            
      e.printStackTrace();        
  }    
 }
}

3. BDHttpClient.java

发布百度博文的实现类,主要是http请求和cookie的处理,以及获取百度令牌token

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
package com.carey.baidublog;
  
import java.io.BufferedReader;
import java.io.IOException;
import java.io.InputStreamReader;
import java.util.HashMap;
import java.util.Set;
import java.util.regex.Matcher;
import java.util.regex.Pattern;  
import org.apache.commons.httpclient.Cookie;
import org.apache.commons.httpclient.HttpClient;
import org.apache.commons.httpclient.HttpMethod;
import org.apache.commons.httpclient.HttpStatus;
import org.apache.commons.httpclient.methods.ByteArrayRequestEntity;
import org.apache.commons.httpclient.methods.GetMethod;
import org.apache.commons.httpclient.methods.PostMethod;  
 
public class BDHttpClient {    
    private static final String LOGIN_URL = "<a href="https://passport.baidu.com/?login">https://passport.baidu.com/?login</a>";    
 private static final String CREATBLOG_URL = "<a href="http://hi.baidu.com/">http://hi.baidu.com/</a>" + Config.USERNAME + "/creat/blog";    
 private static final String MODIFYCATEGORY_URL = "<a href="http://hi.baidu.com/">http://hi.baidu.com/</a>" + Config.USERNAME + "/modify/category/0";    
 private static final String COMMITBLOG_URL = "<a href="http://hi.baidu.com/">http://hi.baidu.com/</a>" + Config.USERNAME + "/commit";      
 private static final int CONNECTION_TIMEOUT = 20000;      
 
 public static void publishBlog(String title, String content, String category) throws Exception {        
  // step 1, login baidu and get cookies        
  Cookie[] cookies = LoginBaidu();          
 
  // step 2, get Bdstoken        
  String bdstoken = getBdstoken(cookies);          
 
  // step 3, publish article        
  if (bdstoken != null) {            
  postBlog(cookies, bdstoken, title, content, category);        
  } else { 
  throw new Exception("bdstoken == null");        
  }    
 }      
 
 private static HttpClient getHttpClient(Cookie[] cookies) {        
  HttpClient httpClient = new HttpClient();          
  if (cookies != null) {            
   // add cookies            
   httpClient.getState().addCookies(cookies);            
   // httpClient.getParams().setParameter("http.protocol.cookie-policy",            
   // CookiePolicy.BROWSER_COMPATIBILITY);        
   }          
   return httpClient;    
  }      
  
 private static Cookie[] LoginBaidu() {        
  HttpClient httpClient = getHttpClient(null);          
  HashMap params = new HashMap();        
  params.put("username", Config.USERNAME);        
  params.put("password", Config.PASSWORD);        
  params.put("pwd", "1");          
  
  try {            
   httpPost(httpClient, LOGIN_URL, generateQueryString(params));        
  } catch (Exception e) {            
   e.printStackTrace();        
  }          
  
  return httpClient.getState().getCookies();    
 }      
 
 private static String getBdstoken(Cookie[] cookies) {        
  HttpClient httpClient = getHttpClient(cookies);          
  try {            
   String res = httpGet(httpClient, CREATBLOG_URL, null);            
   Pattern p = Pattern.compile("bdstoken=([0-9a-z]+)\W");            
   Matcher m = p.matcher(res);            
   
   if (m.find()) {                
    return m.group(1);            
   }        
  } catch (Exception e) {            
   e.printStackTrace();        
  }          
  
  return null;    
 }      
 
 private static String createCategory(Cookie[] cookies, String bdstoken, String category) throws Exception {        
  HttpClient httpClient = getHttpClient(cookies);          
  HashMap params = new HashMap();          
  // bdstoken        
  params.put("bdstoken", bdstoken);          
  
  // create category        
  params.put("ct", "2");        
  params.put("cm", "1");          
  
  // Article category, such as "Android", "Google", "默认分类"       
  params.put("spBlogCatName", category);        
  params.put("spRefURL", MODIFYCATEGORY_URL);          
  return httpPost(httpClient, COMMITBLOG_URL, generateQueryString(params));    
 }      
 
 private static String postBlog(Cookie[] cookies, String bdstoken, String title, String content, String category) throws Exception { 
  HttpClient httpClient = getHttpClient(cookies);          
  HashMap params = new HashMap();          
  
  // bdstoken        
  params.put("bdstoken", bdstoken);          
  
  // new blog        
  params.put("ct", "1");        
  params.put("cm", "1");          
  
  // add a new article        
  params.put("spBlogID", "");        
  params.put("edithid", "");        
  params.put("spBlogCatName_o", "");          
  
  // Article title        
  params.put("spBlogTitle", title);          
  
  // Article content        
  params.put("spBlogText", content);          
  
  // Article category, such as "Android", "Google", "默认分类"       
  params.put("spBlogCatName", category);          
  
  // Article view Permissions,        
  // 0 --> every one 1 --> only friend 3 --> only oneself        
  params.put("spBlogPower", "0");          
  
  // 0: forbidden comment 1: allow comment        
  params.put("spIsCmtAllow", "1");          
  
  // 0: allow share 1: forbidden share        
  params.put("spShareNotAllow", "0");          
  
  // verify code        
  params.put("spVcode", "");        
  params.put("spVerifyKey", "");          
  
  // first create category        
  createCategory(cookies, bdstoken, category);          
  return httpPost(httpClient, COMMITBLOG_URL, generateQueryString(params));    
 }      
 
 private static String generateQueryString(HashMap params) {        
  StringBuffer sb = new StringBuffer();          
  if (params != null && !params.isEmpty()) {            
   Set keys = params.keySet();              
   for (String key : keys) {                
    sb.append(key);                
    sb.append("=");                
    sb.append(params.get(key));                
    sb.append("&");            
   }              
   // remove last &            
   sb.deleteCharAt(sb.length() - 1);        
  }          
  
  return sb.toString();    
 }      
 
 private static String httpGet(HttpClient httpClient, String url, String queryString) throws Exception {       
  String responseData = null;          
  if (queryString != null && !queryString.equals("")) {    
   url += "?" + queryString;        
  }          
  
  GetMethod httpGet = new GetMethod(url);        
  httpGet.getParams().setParameter("http.socket.timeout", new Integer(CONNECTION_TIMEOUT));          
  
  try {            
   int statusCode = httpClient.executeMethod(httpGet);            
   if (statusCode != HttpStatus.SC_OK) {              
    System.err.println("HttpGet Method failed: " + httpGet.getStatusLine());            
   }            
   
   responseData = getResponseBodyAsString(httpGet);        
  } catch (Exception e) {            
   throw new Exception(e);        
  } finally {            
   httpGet.releaseConnection();       
  }    
  
  return responseData;    
 }      
 
 private static String httpPost(HttpClient httpClient, String url, String queryString) throws Exception {   
  String responseData = null;
  PostMethod httpPost = new PostMethod(url); 
  httpPost.addParameter("Content-Type", "application/x-www-form-urlencoded");
        httpPost.getParams().setParameter("http.socket.timeout", new Integer(CONNECTION_TIMEOUT));         
  if (queryString != null && !queryString.equals("")) {            
   httpPost.setRequestEntity(new ByteArrayRequestEntity(queryString.getBytes()));        
  }          
  
  try {            
   int statusCode = httpClient.executeMethod(httpPost);            
   if (statusCode != HttpStatus.SC_OK) {                
    System.err.println("HttpPost Method failed: " + httpPost.getStatusLine());            
   }            
   
   responseData = getResponseBodyAsString(httpPost);        
  } catch (Exception e) {            
   throw new Exception(e);        
  } finally {            
   httpPost.releaseConnection();        
  }          
  
  return responseData;    
 }      
 
 private static String getResponseBodyAsString(HttpMethod httpMethod) {        
  StringBuffer sb = new StringBuffer();          
  try {            
   BufferedReader br = new BufferedReader(new InputStreamReader( httpMethod.getResponseBodyAsStream()));            
   String line = br.readLine();            
   while (line != null) {                
    sb.append(line + "n");                
    line = br.readLine();            
   }        
  } catch (IOException e) {            
   e.printStackTrace();        
  }          
  
  return sb.toString();    
 }
}
package com.carey.baidublog;
  
import java.io.BufferedReader;
import java.io.IOException;
import java.io.InputStreamReader;
import java.util.HashMap;
import java.util.Set;
import java.util.regex.Matcher;
import java.util.regex.Pattern;  
import org.apache.commons.httpclient.Cookie;
import org.apache.commons.httpclient.HttpClient;
import org.apache.commons.httpclient.HttpMethod;
import org.apache.commons.httpclient.HttpStatus;
import org.apache.commons.httpclient.methods.ByteArrayRequestEntity;
import org.apache.commons.httpclient.methods.GetMethod;
import org.apache.commons.httpclient.methods.PostMethod;  

public class BDHttpClient {    
    private static final String LOGIN_URL = "<a href="https://passport.baidu.com/?login">https://passport.baidu.com/?login</a>";    
 private static final String CREATBLOG_URL = "<a href="http://hi.baidu.com/">http://hi.baidu.com/</a>" + Config.USERNAME + "/creat/blog";    
 private static final String MODIFYCATEGORY_URL = "<a href="http://hi.baidu.com/">http://hi.baidu.com/</a>" + Config.USERNAME + "/modify/category/0";    
 private static final String COMMITBLOG_URL = "<a href="http://hi.baidu.com/">http://hi.baidu.com/</a>" + Config.USERNAME + "/commit";      
 private static final int CONNECTION_TIMEOUT = 20000;      
 
 public static void publishBlog(String title, String content, String category) throws Exception {        
  // step 1, login baidu and get cookies        
  Cookie[] cookies = LoginBaidu();          
 
  // step 2, get Bdstoken        
  String bdstoken = getBdstoken(cookies);          
 
  // step 3, publish article        
  if (bdstoken != null) {            
  postBlog(cookies, bdstoken, title, content, category);        
  } else { 
  throw new Exception("bdstoken == null");        
  }    
 }      
 
 private static HttpClient getHttpClient(Cookie[] cookies) {        
  HttpClient httpClient = new HttpClient();          
  if (cookies != null) {            
   // add cookies            
   httpClient.getState().addCookies(cookies);            
   // httpClient.getParams().setParameter("http.protocol.cookie-policy",            
   // CookiePolicy.BROWSER_COMPATIBILITY);        
   }          
   return httpClient;    
  }      
  
 private static Cookie[] LoginBaidu() {        
  HttpClient httpClient = getHttpClient(null);          
  HashMap params = new HashMap();        
  params.put("username", Config.USERNAME);        
  params.put("password", Config.PASSWORD);        
  params.put("pwd", "1");          
  
  try {            
   httpPost(httpClient, LOGIN_URL, generateQueryString(params));        
  } catch (Exception e) {            
   e.printStackTrace();        
  }          
  
  return httpClient.getState().getCookies();    
 }      
 
 private static String getBdstoken(Cookie[] cookies) {        
  HttpClient httpClient = getHttpClient(cookies);          
  try {            
   String res = httpGet(httpClient, CREATBLOG_URL, null);            
   Pattern p = Pattern.compile("bdstoken=([0-9a-z]+)\W");            
   Matcher m = p.matcher(res);            
   
   if (m.find()) {                
    return m.group(1);            
   }        
  } catch (Exception e) {            
   e.printStackTrace();        
  }          
  
  return null;    
 }      
 
 private static String createCategory(Cookie[] cookies, String bdstoken, String category) throws Exception {        
  HttpClient httpClient = getHttpClient(cookies);          
  HashMap params = new HashMap();          
  // bdstoken        
  params.put("bdstoken", bdstoken);          
  
  // create category        
  params.put("ct", "2");        
  params.put("cm", "1");          
  
  // Article category, such as "Android", "Google", "默认分类"       
  params.put("spBlogCatName", category);        
  params.put("spRefURL", MODIFYCATEGORY_URL);          
  return httpPost(httpClient, COMMITBLOG_URL, generateQueryString(params));    
 }      
 
 private static String postBlog(Cookie[] cookies, String bdstoken, String title, String content, String category) throws Exception { 
  HttpClient httpClient = getHttpClient(cookies);          
  HashMap params = new HashMap();          
  
  // bdstoken        
  params.put("bdstoken", bdstoken);          
  
  // new blog        
  params.put("ct", "1");        
  params.put("cm", "1");          
  
  // add a new article        
  params.put("spBlogID", "");        
  params.put("edithid", "");        
  params.put("spBlogCatName_o", "");          
  
  // Article title        
  params.put("spBlogTitle", title);          
  
  // Article content        
  params.put("spBlogText", content);          
  
  // Article category, such as "Android", "Google", "默认分类"       
  params.put("spBlogCatName", category);          
  
  // Article view Permissions,        
  // 0 --> every one 1 --> only friend 3 --> only oneself        
  params.put("spBlogPower", "0");          
  
  // 0: forbidden comment 1: allow comment        
  params.put("spIsCmtAllow", "1");          
  
  // 0: allow share 1: forbidden share        
  params.put("spShareNotAllow", "0");          
  
  // verify code        
  params.put("spVcode", "");        
  params.put("spVerifyKey", "");          
  
  // first create category        
  createCategory(cookies, bdstoken, category);          
  return httpPost(httpClient, COMMITBLOG_URL, generateQueryString(params));    
 }      
 
 private static String generateQueryString(HashMap params) {        
  StringBuffer sb = new StringBuffer();          
  if (params != null && !params.isEmpty()) {            
   Set keys = params.keySet();              
   for (String key : keys) {                
    sb.append(key);                
    sb.append("=");                
    sb.append(params.get(key));                
    sb.append("&");            
   }              
   // remove last &            
   sb.deleteCharAt(sb.length() - 1);        
  }          
  
  return sb.toString();    
 }      
 
 private static String httpGet(HttpClient httpClient, String url, String queryString) throws Exception {       
  String responseData = null;          
  if (queryString != null && !queryString.equals("")) {    
   url += "?" + queryString;        
  }          
  
  GetMethod httpGet = new GetMethod(url);        
  httpGet.getParams().setParameter("http.socket.timeout", new Integer(CONNECTION_TIMEOUT));          
  
  try {            
   int statusCode = httpClient.executeMethod(httpGet);            
   if (statusCode != HttpStatus.SC_OK) {              
    System.err.println("HttpGet Method failed: " + httpGet.getStatusLine());            
   }            
   
   responseData = getResponseBodyAsString(httpGet);        
  } catch (Exception e) {            
   throw new Exception(e);        
  } finally {            
   httpGet.releaseConnection();       
  }    
  
  return responseData;    
 }      
 
 private static String httpPost(HttpClient httpClient, String url, String queryString) throws Exception {   
  String responseData = null;
  PostMethod httpPost = new PostMethod(url); 
  httpPost.addParameter("Content-Type", "application/x-www-form-urlencoded");
        httpPost.getParams().setParameter("http.socket.timeout", new Integer(CONNECTION_TIMEOUT));         
  if (queryString != null && !queryString.equals("")) {            
   httpPost.setRequestEntity(new ByteArrayRequestEntity(queryString.getBytes()));        
  }          
  
  try {            
   int statusCode = httpClient.executeMethod(httpPost);            
   if (statusCode != HttpStatus.SC_OK) {                
    System.err.println("HttpPost Method failed: " + httpPost.getStatusLine());            
   }            
   
   responseData = getResponseBodyAsString(httpPost);        
  } catch (Exception e) {            
   throw new Exception(e);        
  } finally {            
   httpPost.releaseConnection();        
  }          
  
  return responseData;    
 }      
 
 private static String getResponseBodyAsString(HttpMethod httpMethod) {        
  StringBuffer sb = new StringBuffer();          
  try {            
   BufferedReader br = new BufferedReader(new InputStreamReader( httpMethod.getResponseBodyAsStream()));            
   String line = br.readLine();            
   while (line != null) {                
    sb.append(line + "n");                
    line = br.readLine();            
   }        
  } catch (IOException e) {            
   e.printStackTrace();        
  }          
  
  return sb.toString();    
 }
}

程序运行后会在百度空间发布一篇博文的,截图如下:

有兴趣的童鞋希望给改进更好的版本,给我反馈一下,哈哈,就当人人为我,我为人人,做个小广告!

好了,到现在为止,工作都做完了,收工喽!!!

目前有 2 条留言 其中:访客:0 条, 博主:0 条 引用: 2

    给我留言

    留言无头像?


    ×
    腾讯微博