当前位置：首页 > CMS教程 > PHP

php获取页面内容的方法有哪些

管理员 2023-09-05

PHP

123

php获取页面内容的方法有哪些

内容导读

收集整理的这篇技术教程文章主要介绍了php获取页面内容的方法有哪些，小编现在分享给大家，供广大互联网技能从业者学习和参考。文章包含5642字，纯文字阅读大概需要9分钟。

内容图文

PHP获取网页内容的几种方法

方法1：用file_get_contents以get方式获取内容。

<?php $url='http://www.domain.com/?para=123'; $html= file_get_contents($url); echo$html; ?>

方法2：用file_get_contents函数，以post方式获取url。

<?php $url= 'http://www.domain.com/test.php?id=123'; $data= array('foo'=> 'bar'); $data= http_build_query($data); $opts= array( 'http'=> array(    'method'=> 'POST',    'header'=>"Content-type: application/x-www-form-urlencodedrn"  .        "Content-Length: "  . strlen($data) . "rn",    'content'=> $data )); $ctx= stream_context_create($opts);$html= @file_get_contents($url,'',$ctx);

如果需要再传递cookie数据，则把

'header'=>"Content-type: application/x-www-form-urlencodedrn"  ."Content-Length: "  . strlen($data) . "rn",

修改为

'header'=>"Content-type: application/x-www-form-urlencodedrn" . "Content-Length: " .strlen($data) . "rn". "cookie:cookie1=c1;cookie2=c2rn";

即可。

方法3：用fopen打开url，以get方式获取内容。

<?php $fp= fopen($url,'r'); $header= stream_get_meta_data($fp);//获取报头信息 while(!feof($fp)) { $result.= fgets($fp, 1024); } echo"url header: {$header} <br>": echo"url body: $result"; fclose($fp); ?>

相关推荐：《PHP入门教程》

方法4：用fopen打开url，以post方式获取内容。

<?php $data= array('foo2'=> 'bar2','foo3'=>'bar3'); $data= http_build_query($data); $opts= array('http'=> array( 'method'=> 'POST','header'=>"Content-type: application/x-www-form-urlencodedrnCookie:cook1=c3;cook2=c4rn"  . "Content-Length: "  . strlen($data) . "rn", 'content'=> $data ) ); $context= stream_context_create($opts); $html= fopen('http://www.test.com/zzzz.php?id=i3&id2=i4','rb',false, $context); $w=fread($html,1024);echo$w; ?>

方法5：用fsockopen函数打开url，以get方式获取完整的数据，包括header和body。

<?php functionget_url ($url,$cookie=false) { $url= parse_url($url); $query= $url[path]."?".$url[query];echo"Query:".$query;$fp= fsockopen($url[host],$url[port]?$url[port]:80 , $errno,$errstr, 30); if(!$fp) {returnfalse; }else{ $request= "GET $query HTTP/1.1rn";$request.= "Host: $url[host]rn"; $request.= "Connection: Closern"; if($cookie)$request.="Cookie:   $cookien"; $request.="rn"; fwrite($fp,$request); while(!@feof($fp)) { $result.= @fgets($fp, 1024); } fclose($fp); return$result; } } //获取url的html部分，去掉header functionGetUrlHTML($url,$cookie=false) { $rowdata= get_url($url,$cookie); if($rowdata) { $body=stristr($rowdata,"rnrn"); $body=substr($body,4,strlen($body)); return$body; }       returnfalse; } ?>

方法6：用fsockopen函数打开url，以POST方式获取完整的数据，包括header和body。

<?php functionHTTP_Post($URL,$data,$cookie,$referrer="") {    // parsing the given URL $URL_Info=parse_url($URL);   // Building referrer if($referrer=="")// if not given use this script as referrer $referrer="111";    // making string from $data foreach($dataas$key=>$value) $values[]="$key=".urlencode($value); $data_string=implode("&",$values);   // Find out which port is needed - if not given use standard (=80) if(!isset($URL_Info["port"])) $URL_Info["port"]=80;      // building POST-request:$request.="POST ".$URL_Info["path"]." HTTP/1.1n";$request.="Host: ".$URL_Info["host"]."n"; $request.="Referer: $referern";$request.="Content-type: application/x-www-form-urlencodedn"; $request.="Content-length: ".strlen($data_string)."n"; $request.="Connection: closen";       $request.="Cookie:   $cookien";       $request.="n"; $request.=$data_string."n";       $fp= fsockopen($URL_Info["host"],$URL_Info["port"]); fputs($fp,$request); while(!feof($fp)) { $result.= fgets($fp, 1024); } fclose($fp);       return$result; } ?>

方法7：使用curl库，使用curl库之前，可能需要查看一下php.ini是否已经打开了curl扩展。

<?php$ch= curl_init(); $timeout= 5;curl_setopt ($ch, CURLOPT_URL, 'http://www.domain.com/');curl_setopt ($ch, CURLOPT_RETURNTRANSFER, 1);curl_setopt ($ch, CURLOPT_CONNECTTIMEOUT, $timeout);$file_contents= curl_exec($ch);curl_close($ch); echo$file_contents;?>

这里收集了3种利用php获得网页源代码抓取网页内容的方法，我们可以根据实际需要选用。

1、使用file_get_contents获得网页源代码

这个方法最常用，只需要两行代码即可，非常简单方便。

参考代码：

<?php$fh= file_get_contents('http://www.webkaka.com/');echo $fh;?>

2、使用fopen获得网页源代码

这个方法用的人也不少，不过代码有点多。

参考代码：

<?php$fh = fopen('http://www.webkaka.com/', 'r');if($fh){    while(!feof($fh)) {        echo fgets($fh);    }}?>

3、使用curl获得网页源代码

使用curl获得网页源代码的做法，往往是需要更高要求的人使用，例如当你需要在抓取网页内容的同时，得到网页header信息，还有ENCODING编码的使用，USERAGENT的使用等等。

参考代码一：

<?php// 创建一个新cURL资源$ch = curl_init();// 设置URL和相应的选项curl_setopt($ch, CURLOPT_URL, "http://www.webkaka.com/");curl_setopt($ch, CURLOPT_HEADER, false);// 抓取URL并把它传递给浏览器data=curlexec(ch);echo $data;//关闭cURL资源，并且释放系统资源curl_close($ch);?>

参考代码二：

<?php$szUrl = "http://www.webkaka.com/";$UserAgent = 'Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.0; SLCC1; .NET CLR 2.0.50727; .NET CLR 3.0.04506; .NET CLR 3.5.21022; .NET CLR 1.0.3705; .NET CLR 1.1.4322)';$curl = curl_init();curl_setopt(curl,CURLOPTURL,szUrl);curl_setopt($curl, CURLOPT_HEADER, 0);  //0表示不

输出Header，1表示输出curl_setopt($curl, CURLOPT_RETURNTRANSFER, 1);curl_setopt($curl, CURLOPT_SSL_VERIFYPEER, false);curl_setopt($curl, CURLOPT_SSL_VERIFYHOST, false);curl_setopt($curl, CURLOPT_ENCODING, '');curl_setopt(curl,CURLOPTUSERAGENT,UserAgent);curl_setopt($curl, CURLOPT_FOLLOWLOCATION, 1);data=curlexec(curl);echo $data;//echo curl_errno($curl); //返回0时表示程序执行成功如何从curl_errno返回值获取错误信息

以上就是php获取页面内容的方法有哪些的详细内容，更多请关注Gxl网其它相关文章！

内容总结

以上是为您收集整理的php获取页面内容的方法有哪些全部内容，希望文章能够帮你解决php获取页面内容的方法有哪些所遇到的程序开发问题。如果觉得技术教程内容还不错，欢迎将网站推荐给程序员好友。

内容备注

版权声明：本文内容由互联网用户自发贡献，该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容，请发送邮件至举报，一经查实，本站将立刻删除。

未经允许不得转载：Str Tom工作室 » php获取页面内容的方法有哪些